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Mean-field driven first-order phase transitions 
in systems witli long-range interactions 

Marek Biskup,^ Lincoln Chayes,' and Nicholas Crawford^ 



We consider a class of spin systems on Z'^ with vector valued spins {Sx) that in- 
teract via the pair-potentials J^^y Sx ■ Sy. The interactions are generally spread- 
out in the sense that the Jx.y's exhibit either exponential or power-law fall-off. 
Under the technical condition of reflection positivity and for sufficiently spread 
out interactions, we prove that the model exhibits a first-order phase transition 
whenever the associated mean-field theory signals such a transition. As a con- 
sequence, e.g., in dimensions d > 3, we can finally provide examples of the 
3-state Potts model with spread-out, exponentially decaying interactions, which 
undergoes a first-order phase transition as the temperature varies. Similar transi- 
tions are established in dimensions d — 1,2 for power-law decaying interactions 
and in high dimensions for next-nearest neighbor couplings. In addition, we also 
investigate the limit of infinitely spread-out interactions. Specifically, we show 
that once the mean-field theory is in a unique "state," then in any sequence of 
translation-invariant Gibbs states various observables converge to their mean- 
field values and the states themselves converge to a product measure. 

Key Words: First-order phase transitions, mean-field theory, infrared bounds, 
reflection positivity, mean-field bounds. Potts model, Blume-Capel model. 
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1. INTRODUCTION 
1.1. Motivation 

The understanding of the quantitative aspects of phase transitions is one of 
the basic problems encountered in physical (and other) sciences. Most of the 
existing mathematical approaches are based on the use of contour expansions 
via Pirogov-Sinai theory [41,42, 50] and/or the use of correlation inequali- 
ties [21,44,45]. Notwithstanding, many "practical" scientists still rely on the 
so-called mean-field theory which, in its systematic form, goes back to the 
work of Landau. From the perspective of mathematical physics, it is therefore 
desirable to shed as much light as possible on various mean-field theories and, 
in particular, attempt to place the subject on an entirely rigorous basis. 

In a recent paper [11], two of us have established a direct connection be- 
tween temperature-driven first-order phase transitions in certain ferromagnetic 
nearest-neighbor spin systems on and their mean-field counterparts. The 
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principal result of Ref. [11] states that, once the mean-field theory signals a 
first-order phase transition, the actual system has a similar transition provided 
the dimension d is sufficiently large and/or the mean-field transition is suffi- 
ciently strong. Moreover, the transition happens for the values of parameters 
that are appropriately "near" the mean-field transitional values; indeed, the var- 
ious error terms tend to zero as d — > oo. 

The principal goal of the present paper is two-fold. First, we will con- 
siderably extend the scope of systems to which the ideas of Ref. [11] apply; 
i.e., we will prove discontinuous phase transitions in systems which heretofore 
have been beyond the reach of rigorous methods. Second, we will in a general 
way expound on the mean-field philosophy. In particular, we will demonstrate 
that mean-field theory provides an asymptotic description of a certain class of 
systems regardless of the nature of their transitions. 

Our approach is somewhat akin to the bulk of work on the so-called Kac 
limit of lattice [14-17] as well as continuum [30,36,37] systems. Here one 
considers finite-range interactions of unit total strength which are smeared out 
over a region of scale V7. As 7 tends to zero, each individual site interacts with 
larger and larger number of other sites and so, for 7 <C 1, one is in the position 
to prove that the characteristics of an actual system (e.g., the magnetization) 
are close to those of the corresponding mean-field theory. In particular, all 
"approximations" (i.e., upper and lower bounds) become exact as 7 J, 0. 

Notwithstanding, the similarity between the Kac limit and our approach 
ends with the above statements: Our technique involves tight bounds on the 
fluctuations of the effective field while the analyses of Refs. [14-17] are based 
on coarse-graining arguments. As a consequence, we have no difficulty treating 
models with complicated single-spin spaces — even those exhibiting continuous 
internal symmetries or leading to power-law decay of correlations — or nearest- 
neighbor systems in large dimensions. Of course, there is a price to pay: Our 
technique requires the infrared bound on two-point correlation function which 
is presently available only for models obeying the condition of reflection posi- 
tivity. Moreover, unless we assume power-law decaying interactions, the use of 
infrared bounds does not permit any statements ind — 2, while the Kac-limit 
approach works equally well in all d > 2. 

1.2. Models of Interest 

For the duration of the paper, as in Ref. [1 1], we will focus on spin models with 
two body interactions as described by the formal Hamiltonian 

= J] J,,y (5,, Sy) - ^(/i, 5,). (1.1) 

{x,y) ^ 
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The various objects on the right-hand side are as follows: /3 is the inverse tem- 
perature, {x, y) denotes an unordered pair of distinct sites, J.j..y (= Jy^^) is the 
coupling constant associated with this pair, the spins Sx take values in a com- 
pact set O C M", the (reduced) external field /i is a vector from M" and (■, ■) 
denotes some inner product in M". Implicit in the notation is an underlying a 
priori measure on Q which represents the behavior of the spins in the absence 
of interactions. (In principle, the term which describes the coupling to the ex- 
ternal field, namely the {h, S^Y^, could be absorbed into the definition of the a 
priori measure. However, for aesthetic reasons, here we will often retain these 
terms as part of the interaction.) 

Mean-field behavior is typically anticipated in situations where fluctua- 
tions are insignificant and, on general grounds, one expects this to be the case 
in high dimensions. These were precisely the operating conditions of Ref. [11] 
(as well as of Refs. [13,34]) where, in a mathematically precise sense, the 
stipulation concerning the fluctuations was vindicated. However, an alternative 
route for ramping down fluctuations is to consider "spread out" interactions, 
i.e., Jx,y's which do not go to zero too quickly. As alluded to earlier, this alter- 
native is, in fact, the common starting point for modem mathematical studies 
of phase transitions based on mean-field theory, e.g., Refs. [14-17,36] and 
Refs. [26-29,43]. 

Unfortunately, we do not have complete flexibility as to how we can 
spread out our interactions. Indeed, our principal error estimate requires that 

the {Jx,y) satisfy the condition of reflection positivity (RP). Notwithstanding, 
the following three classes of interactions are available to our methods: 

(1) Nearest along with next-nearest neighbor couplings, i.e., potentials 
such that Jx,y = A if ,T and y are nearest neighbors, J, „ = 
with A > 2 (rf — 1 ) I K, I if ,T and y are next- nearest neighbors and Jx,y = 
in the remaining cases. 

(2) Yukawa-type potentials of the form 



T _ p.-ti\x-y\i 
'-'x,y ^ ) 



(1.2) 



where > and \x — y\i is the £^-distance between x and y. 



(3) Power-law decaying interactions of the specific form 



1 



(1.3) 



X - y\\ 



s ' 



with s > 0. 



Aside from these "pure" interactions, reflection positivity holds for 
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(4) any combination of the above with positive coefficients. 

The derivation of the reflection-positivity property for these interactions goes 
back to the classic references on the subject [22-24]; for reader's convenience 
we will provide additional details in Sect. 3.1 and Sect. 4 (Remark 4.5). 

We note that for all positive values of s the interactions listed in item (3) 
are indeed, in the technical sense, reflection positive. However, some values 
of s are not viable and others are not particularly useful. Specifically, if s < d, 
then the interaction is attractive and non-summable so there is no thermody- 
namics. Thus we may as well assume that s > d. Furthermore, if d — 1 and 
s > 2 or d = 2 and s > 4 then our methods break down. With some reason: 
In the one dimensional cases with s > 2, the results of Refs. [3, 19, 20, 39,47] 
indicate (and in specific cases prove) that no magnetic ordering is possible. 
Similarly, in the above mentioned two-dimensional cases, magnetic ordering is 
precluded in many systems. 

To summarize, we will impose the following limitations on our power-law 
interactions in Eq. (1.3): 

(a) s < 2 in d = 1, 

(b) s < 4 in = 2, 

(c) s > d in all d > 1. 

Although case (1) does not give us any real options for spreading the inter- 
action beyond the previous recourse of taking ^ 1, cases (2) and (3) offer 
us the possibility to do so on a fixed lattice. This is essentially obvious in 
case (2) — just take the parameter small. As for case (3) it is seen, after a lit- 
tle thought, that taking s close to d presents an additional and powerful method 
for smearing interactions. 

1.3. Outline of results 

Given the ability to smear interactions on a fixed lattice, much of the technol- 
ogy developed in Ref. [11] can be applied without the stipulation of "rf suffi- 
ciently large." Thus it will prove possible to make statements about specific 
models on reasonable lattices with (more or less) reasonable interactions. 

One such "specific" model will be the g-state Potts model (see Sect. 2.2). 
Here, for example, we will establish a discontinuous transition between the or- 
dered and disordered states of a 3-state Potts model on Z'^ with interactions de- 
caying to zero exponentially. (And similarly for any other g-state Potts model 
on with q > 3 and d > 3.) Analogous first-order phase transitions are also 
proved in dimensions one and two provided we have power-law decay of the 
couplings as discussed above. For example, in d = 1, for any power-law decay 
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exponent s e (1, 2), we produce couplings such that the 3-state Potts model 
has a first-order transition as the overall strength of the coupling varies. 

As another illustration, we consider the low temperature behavior of the 
Blume-Capel model. The system will be described precisely in Sect. 3.4, for 
now it suffices to say that the spins take values in { — 1,0,+1} with a priori 
equal weights. The zero temperature phase diagram of this model has a triple 
point where the three states of constant spin are degenerate in energy, however, 
as demonstrated in Ref. [46], this degeneracy is broken at finite temperatures 
in favor of the state dominated by the zeros. The previous analyses of this 
phenomenon required rather detailed contour estimates; here we will establish 
similar results by relatively painless methods. 

The techniques at our disposal will allow us to put to rest some small 
controversies which, in recent years, have been topics of some discussion. For 
instance, a conjecture has been made [32,33] which boils down to the statement 
that in any one-dimensional finite-state spin system with arbitrary translation- 
invariant, summable interaction, the set of phase-coexistence points at positive 
temperatures is a subset of the corresponding set at zero temperature. We will 
rule this out by our analysis of the Potts models in an external field. 

In addition to predicting first-order transitions, our mean-field framework 
provides an explicit description of general lattice spin systems in the limit when 
the interactions become highly diffuse. In particular we show that, whenever 
the mean-field theory is in a unique "state," the magnetization and the energy 
density of the actual system converge to their mean-field counterparts. More- 
over, every translation invariant Gibbs state converges to a product (i.i.d.) mea- 
sure with individual-spin distribution self-consistently adjusted to produce the 
correct value of the magnetization. (This vindicates the assumptions typically 
used to "justify" mean-field theory; see Sect. 2.1.) Results in this direction 
have appeared before; cf Refs. [13, 34], but the main difference is that here we 
are not forcing d ^ oo and hence it is possible to envision a limiting system 
towards which we are heading. 

1.4. Organization 

The organization of the remainder of this paper is as follows: In Sect. 2.1 
we describe, in succinct terms, some general aspects of mean-field theory. In 
Sect. 2.2 we discuss the mean-field theory for the Potts model in an external 
field — which is the primary model studied in this work. Precise results con- 
cerning these situations are the subject of Sect. 2.3. 

Sect. 3 is devoted to the statements of our main result. Specifically, in 
Sect. 3.1 we formulate a general theorem (Theorem 3.2) that allows us to prove 
first-order phase transitions in actual lattice models with interaction (1.1) — and 
RP couplings — by comparison to the associated mean-field theory. Sect. 3.2 



Mean-field driven first-order phase transitions 



7 



provides conditions under which the mean-field theory is obtained as a limit of 
lattice systems when the interaction becomes infinitely spread out. Sects. 3.3 
and 3.4 contain precise statements of our theorems concerning the behavior of 
the specific systems we study: The zero-field g-state Potts models with g > 3, 
the same model (with g > 4) in an external field which enhances or supresses — 
depending on the sign — one of the states, and the Blume-Capel model near its 
zero-temperature triple point. Sect. 3.5 mentions some recent conjectures that 
can be addressed using our results. 

The principal subject of Sect. 4 is to give the proof of our general results 
(Theorems 3.2 and 3.3). As part of the proof, we will discuss certain interest- 
ing convexity bounds (Sect. 4.1), reflection positivity (Sect. 4.2) and infrared 
bounds (Sect. 4.3). In Sect. 4.5 we show how the specific interactions listed 
in Sect. 1.2 fit into our general scheme. Sect. 5 is devoted to the mathematical 
details of the mean-field theories for all the above mentioned models; in par- 
ticular the proofs of all claims made in Sect. 2.3. Sect. 6 then assembles all 
ingredients into the proofs for actual lattice systems. 

2. MEAN-FIELD THEORY AND THE POTTS MODEL 

Here we shall recall to mind a formalism underlying (our version of) mean- 
field theory and provide heuristic discussion of the basic facts. The specifics 

will be demonstrated on an example of the g-state Potts model in an external 
field; first somewhat informally in Sect. 2.2 and then precisely in Sect. 2.3. 

2.1. Mean-field heuristic 

We will focus on the situations described by the Hamiltonian in Eq. (1.1). Of 
course the real models must be carefully defined on Z'' as limits of finite vol- 
ume measures corresponding to this Hamiltonian at inverse temperature P and 
some sort of boundary conditions. We shall assume the reader is familiar with 
this basic theory (enough of the relevant formalism can be found in Sect. 3.1) 
and skip right to the consideration of an infinite-volume translation-invariant 
Gibbs state np^h corresponding to the Hamiltonian in Eq. (1.1) and inverse 
temperature p. For convience we will assume here, as in the rest of this paper, 

Jx,x^O, ^|Jo,^|<oo and J^^y = 1. (2.1) 

We will let denote the expectation with respect fig ^ and Eq expectation 
with respect to the a priori (product) measure /^o- (We will of course assume 
in the following that /xq is supported on more than one point.) 

The principal idea is to study the distribution of one spin variable, e.g., the 
one at the origin of coordinates. Let m denote the expected value of this spin. 
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m — Ep^h{So). Then, conditioning on the configuration in the complement of 
the origin, we get the identity 

where mo is the random variable given by the weighted average 

mo = Jo,x S^. (2.3) 

We emphasize that the expectation "acts" only on mo while Eo "acts" 
only on the auxiliary spin variable S. 

When all is said and done, the underlying assumption behind the standard 
mean-field theories boils down to the statement that the quantity mo is non- 
random, and therefore equal to m. Postponing, momentarily, any discussion 
that concerns the validity of such an assumption, the immediate relevance is 
that in Eq. (2.2) we can replace mo by m which in turn makes the outer expec- 
tation on the right-hand side redundant. We thus arrive at the self-consistency 
constraint 

Eo(5e(^''^'^+'^)) 

which is the mean-field equation for the magnetization. Clearly, if it can be 
established that the fluctuations of mo are negligible, then the actual magneti- 
zation must be near a solution of Eq. (2.4). 

In this light, our results are not that hard to understand: In most instances 
where the mean-field theory predicts a discontinuous transition this prediction 
is showcased by the fact that Eq. (2.4) simply does not admit continuous solu- 
tions. Thus if the error caused in the approximation mo m is much smaller 
than the discontinuities predicted in the mean-field approximation, jumps of 
the physical magnetization cannot be avoided. 

As all of the above is predicated on the near constancy of the random 
variable mo, let us turn to a discussion of the fluctuations of this quantity. An 
easy calculation shows that 

Var(mo) = ^ Jo,xJo,y^fS,h{{Sx: Sy) - \m\^) (2.5) 

where |mp = (m, m). The quantity E,Q^h{{S,j;, Sy) — |mp) is the thermal 
two-point correlation function which, on general grounds, may be presumed to 
tend to zero at large separations. It would thus seem that the stipulation of a 
"spread out interaction" along with any sort of decay estimate on the two-point 
correlations would allow us to conclude that the variance of mo is indeed small. 
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However, while explanations of this sort are satisfactory at a heuristic level, 
a second glance at Eq. (2.5) indicates that the task is not necessarily trivial. 
Indeed, of actual interest is the decay of correlations within the effective range 
of the interaction, which is guaranteed to be delicate. At the core of this paper 
is the use of reflection positivity to provide these sorts of estimates. 

In many cases, Eq. (2.4) on its own is insufficient for understanding the 
behavior of a system — even at the level of mean-field theory. Specifically, in 
the case of a discontinuous transition, Eq. (2.4) will typically have multiple so- 
lutions the overall structure of which does not allow for a continuous solution. 
While this may have the advantage of signaling the existence of discontinu- 
ities, it does not provide any insight as to where the discontinuities actually oc- 
cur. Thus, whenever there are multiple solutions to Eq. (2.4), a supplementary 
"rule" is needed to determine which of these solutions ought to be selected. 

The supplement — or starting point of the whole theory depending on 
one's perspective — is the introduction of the meanfleld free-energy func- 
tion ^p^h{m) defined as follows: Let S{m) be the entropy function associated 
with the a priori measure on the spins. Formally, this quantity is defined by 
means of the Legendre transform 

S(m) = ini {G(b) -(b,m)] (2.6) 
6eR" «. ^ ' 'J 

of the cumulant generating function 

G'(6)=logEo(e('''^)). (2.7) 

The mean-field free-energy function is then defined as the difference of the 
energy function, E{m) — — | |mp — (h, m), and the entropy S{m): 

^/3,a(m) = -^|m|2 - (h,m) - S{m). (2.8) 

Then, as is not hard to see, the mean-field equation is implied by the condition 
that ^p^h be minimized. Indeed, writing V$/3,/i(m) = some straightforward 
manipulations give us 

m = VG'(/3m + /i), (2.9) 

which is exactly Eq. (2.4). 

Eq. (2.8) along with the stipulation to minimize adds a whole new dimen- 
sion to the theory that was defined by Eq. (2.4). Foremost, in the case of multi- 
ple solutions, we now have a "rule" for the selection of the relevant solutions. 
Beyond this, we have a framework resembling a full thermodynamical theory: 
A free energy — defined by evaluating ^p^h at the minimizing m — along with 
an entropy and energy which are the corresponding functions evaluated at this 
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magnetization. In fact, a secondary goal of this work is to demonstrate that this 
"more complete" mean-field theory provides an asymptotic description of the 
actual theories with spread out interactions. 

Remark 2.1. We conclude this subsection with the remark that the 

mean-field theory for any particular Hamiltonian of the form (1.1) can be 
produced in an actual spin-system by considering the model on the complete 
graph. Explicitly, for a system with sites, we take J^^y = j^, compute all 
quantities according to the standard rules of statistical mechanics and then take 
— > oo. The result of this procedure is the mean-field theory described in 
this subsection for the thermodynamics and a limiting distribution for the spins 
which is i.i.d. The connection between mean-field theory and complete graph 
models is well known and has been proved in numerous special cases (see, 
e.g., Ref. [18] for a recent study of ensemble equivalence for the Potts model 
on the complete graph). A complete proof for the general form of M' given in 
Eq. (1.1) appears e.g. in Sect. 5 of Ref. [11]. 

2.2. Potts models in external field 

The best example of a system which exhibits a rich spectrum of behaviors 
while remaining tractable is the Potts model in an external field. The Potts 
model is typically defined using discrete spin variables e {1, . . . , g} with 
no apparent internal geometry. The energy of a configuration is given by the 
(formal) Hamiltonian 



Here (3 is the inverse temperature, the Jx,y'^ are the coupling constants for the 
system, and S^^^cry is the Kronecker delta. The reduced external field h is related 
to the physical external field hviah — h/p. We have chosen only the state "1" 
as the state affected by the external field even though more general versions are 
also possible [7,9, 10, 12]. 

This system is cast in the form of Eq. (1.1) by using the tetrahedral rep- 
resentation: We take spin variables G {vi, . . . v^}, where the v^'s are the 
vertices of a unit tetrahedron in W"^. Inner products (defined the usual way 
for vectors in R^"^) between the v^'s satisfy 




(2.10) 




if A; = Z, 
otherwise, 



(2.11) 



and so 



5, 



1 



1 




(2.12) 
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After similar consideration of the magnetic field terms, it is seen that the Hamil- 

tonian in Eq. (2.10) is manifestly of the form in Eq. (1.1). To stay in accord 
with the classic references on the subject, e.g., Ref. [48], we will keep the q- 
dependent prefactor suggested by Eq. (2.12). So, our official Hamiltonian for 
the Potts model will read 

^jf ^ -^13 J2 J.,y {Sx, Sy) - i^/i ^(vi, S,) (2.13) 

^ {x,y) ^ 

with the J's obeying Eq. (2.1) and /i G M. 

The mean-field theory is best expressed in terms of the vector magnetiza- 
tion given by 

m = XiYi H h XqYq, (2.14) 

and the mean-field free-energy function is [1 1, 48] 

q n 
k=l 

Here the "barycentric" coordinates Xk are components of a probability vectors, 

i.e., we have > and xi + \- Xq — 1. In the context of the Potts model 

on a complete graph, x^ represents the fraction of sites in the k-ih. spin state. 

Let us start with a recapitulation of the zero-field case where the resulting 
theory is quite well known. For each q there is a number j3^^ such that if 
(3 < /9^p, the unique global minimizer is the "most symmetric state," m = 0, 
while for (3 > /3^p, there are exactly q (asymmetric) global minima which 
are permutations of one probability vector of the form xi > X2 — • ■ ■ — Xq. 
Thus we may express all quantities in terms of a scalar magnetization, e.g., 
xi = ^ m and Xk ^ \ - k = 2, . . . , Then, when /3 > the 

mean-field magnetization is given by mMF(/9) = where 9 is the maximal 
positive solution to the equation 

qPO - 1 
+ g — 1 

The crucial point — which can be gleaned form a perturbative analysis of 
Eq. (2.16) — is the division at q = 2 of two types of behavior. In particu- 
lar, rriMFiP) tends to a strictly positive value as l3 i /S^p for g > 2, while 
for = 2 the limit value is zero. (Indeed, for q — 2, there are no nontrivial 
solutions to Eq. (2.16) at /3 = = 2.) 

Remark 2.2. Interestingly, the values of (3^1 and the limit value 
"^mf(/3mf) explicitly computable: 

/9S = 2^1og(g-l), mMF(/?S) = ^. (2.17) 
g z q 
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This observation goes back to at least Ref. [48]. 

Let us now anticipate, without going to details, what happens for h Q. 
(The full-blown statements and proofs will appear in Sect. 2.3 and Sect. 5, re- 
spectively.) We will capitalize on the principle that local minimizers are stable 
to small changes in parameters. Consider g > 3 and h ^ such that -C 1. 
The overall situation cannot differ too drastically from the zero-field case; the 
only distinction is that for h > only one of the "/i = asymmetric mini- 
mizers" is allowed while for /i < the same minimizer is suppressed in favor 
of the remaining q — 1 ones. On the other hand, for h positive and large, it is 
clear that the minimizer of ^^^\{m) will be unique no matter what (3 is. Thus, 
for /i > we should have a line of mean-field first-order phase transitions 
which terminates at a finite value of h. On general grounds, the terminal point 
is expected to be a critical point. 

Next, let us consider h < with \h\ 3> 1. The situation at /i = — oo is 
clear; this is just the (q — l)-state Potts model. Thus for finite but large we 
can see a clear distinction between q = 3 and g > 3. In the former cases, the 
mean-field transition should be Ising like and hence continuous. In the latter 
case, the transition should be discontinuous. Thus, the g = 3 line should break 
at a tricritical point followed by a line of continuous transitions while for g > 3 
there will be an unbroken line of discontinuous mean-field phase transitions. 

Aside from general interest, the key motivation for obtaining such detailed 
knowledge about ttimf is as follows: Under specific conditions on (1.1), virtu- 
ally all that has just been discussed pertaining to discontinuous transitions in 
these systems can be established with rigor in the spread out "real" systems. 
(On the downside is the fact that virtually nothing pertaining to the continuous 
transition can be proved by these methods.) To illustrate let us consider the 
transition at /i > when q is large. The mean-field picture is as follows: A 
non-convexity of $^^)j(m) develops when j3 is of order unity, but it does not 
"touch down" until j3 is appreciable (of order log q). However, the existence of 
a non-convexity suggests that a strong-enough magnetic field can tilt the bal- 
ance in favor of a magnetized state, even for /5's of order unity. This is indeed 
the case for the MFT as our detailed calculations later show. As a consequence 
of the general techniques presented here, this result from the MFT will be pro- 
cessed into a theorem for actual systems. 

2.3. Precise statements for mean-field Potts model 

Our precise results for the mean-field theory of the Potts model in an external 
field are summarized into two theorems; one for positive fields and the other 
for negative fields. 
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Theorem 2.3. (Positive fields) Letq>3, let m and the probabil- 
ity vector (xi, . . . , Xg) be related as in Eq. (2.14) and let ^^^\{m) denote the 
function from Eq. (2.15). Let denote the quantity 

/ic-logg- (2.18) 
Q 

Then there is a continuous function /3^^^ : (0, he) (0, oo) such that 

(1) For all {j3, h) such that either h > he or /3 ^ fi^\h), there is a unique 

global minimizer of ^^^\{m) with X2 = ■■■ = Xg. The quantity xi 
corresponding to this minimizer is strictly larger than the mutual value 
of the Xk 'sfor k = 2, . . . ,q. 

(2) For all h < he, there are two distinct global minimizers of^^^\{m) at 

(3) For (P, h) such that h > or (3 ^ (3^\h), let Xi — Xi(P, h) denote 
the first coordinate of the global minimizer of ^^p\{ra). Then ((3, h) i— > 
Xi{P, h) is continuous with well-defined but distinct (one-sided) limits 
at {(3, h) = {f3^_^\h), h). Furthermore, writing Xi — ^-\-m, the quantity 
9 = -^rn obeys the equation 

in the region of uniqueness. At the points {(3^\h), h), both limiting 
values obey this equation. 

(4) The function h i-^ f3^\h) is strictly decreasing on (0, /ic) with limit 
values I3f{h) ] pj^l = log(g -l)ashiO and I3^^\h) [ as 
h t he. 

In order to preserve uniformity of exposition, we will restrict the statement 
of negative-field results to g > 4. 

Theorem 2.4. (Negative fields) Let q>A, let m and the probabil- 
ity vector (xi, . . . , Xq) be related as in Eq. (2.14) and let $^^)j(m) denote the 
function from Eq. (2.15). Then we have: 

(1) All global minima are permutations in the last q — 1 variables of vectors 
with the representation 

Xi < X2 — ■ • ■ — Xq-i < Xg. (2.20) 
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Moreover, there exists a function f3^^ : (— oo,0) — > (0,oo) such that the fol- 
lowing hold: 

(2) (Symmetric Minimum) For all j3 < P^\h), there is a unique global 
minimum and it has X2 — ■ ■ ■ — Xq. Moreover, ifm is such that xi = ^ — 
m and x^ = ^ + for all k — 2, . . . ,q, then 9 = corresponds 
to a global minimum when 

^ = (^_i)e/3^-. + i- (2.21) 

There is only one 

e [0, for which Eq. (2.21) holds. 

(3) (Asymmetric Minima) For all f3 > f3^\h), we have q — 1 global minima. 
These are permutations in the last q — 1 variables of a single minimum 
whose coordinate representation takes the form 

Xi < X2 ~ ■ • ■ — Xg-i < Xq. (2.22) 

(4) At P — (3-\h) there are q global minima. One of these is of the type 
described in ( 2 ) — namely, the symmetric minimum — while the other q—1 
are of the type described in (3). 

(5) The function h i— > (3^\h) is strictly increasing and continuous. More- 
over, we have the limits 

lim = 4V and lim I3^^\h) = ^l^l (2.23) 

h-^—oo h'[0 

Theorem 2.3 is proved in Sect. 5.3 and Theorem 2.4 is proved in Sect. 5.4. 
The corresponding statement for the actual lattice systems is the subject of 
Theorem 3.5. 



3. MAIN RESULTS 

Here we give the statements of the principal theorems which apply to any 
model whose interaction is of the type (1.1). Then we apply these to the Potts 
and Blume-Capel models. 



3.1. General theory 

We begin by a precise definition of the class of models we consider. Let Q be a 
compact subset of R", with the inner product denoted by ( • , • ) , and let Conv (Q) 
denote the convex hull of Q. Let //q be a Borel probability measure on (Q, ^) 
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that describes the a priori distribution of the individual spins. We will consider 
spin configurations (S^) from Q^'* and, abusing the notation slightly, use /Iq to 
denote also the corresponding a priori product measure. 

To define the interacting spin system, let us pick a finite set A C Z^, a 
spin configuration 5a G in A and the "boundary condition" S^c e Vt^\ 
For each /i e R" and each /3 > 0, we then define the finite-volume Hamiltonian 
J^(5a, 5ac) by 

/?J^a(5a,5ac) = -/3 Jx,y{S,,Sy)-Y,ih,S,). (3.1) 

{x,y) x£A 

The first sum goes over all unordered pairs of distinct sites {x, y) at least one 
of which is contained in A. 

The above Hamiltonian can now be used to define the finite-volume Gibbs 
measure v'^^^"'' on spin configuration from Vt^ by 

^ '^^^^^ = yiS^U, /^o(d5A), (3.2) 

( S c 1 

where the normalizing constant Z)^ ^ {(3, h) is the partition function. Of par- 
ticular interest are the (weak subsequential) limits of these measures as A ex- 
pands to fill out the entire U^. These measures obey the DLR-conditions [25] 
and are generally referred to as (infinite-volume) Gibbs measures. In this for- 
malism, phase coexistence is said to occur for parameters (3 and h if there is 
more than one limiting Gibbs measure. Under these conditions the system is 
said to exhibit a first-order phase transition. 

We proceed by formulating the precise conditions under which our results 
will be proved. To facilitate our next definition, for each lattice direction £ e 
{1, . . . , li}, let denote the half-space 

= {x = (xi, . . . , Xd) e If, xt > 0}. (3.3) 

We will use 'd^^^ to denote the reflection ^^^^ : — > Z"* \ defined explicitly 
by the formula '&'^^\xi, ...,Xd) = (xi, . . . , xe-i, 1 - xe, xe+i, . . . , xa). 

Definition 3.1. (RP "through bonds") Consider a collection of 
coupling constants {Jx,y)x,yeZ'i- We say that these are RP if the following con- 
ditions hold: 

(1) (translation invariance) for any x,y we have Jx,y = Jo,y-x- 
Moreover, for any lattice direction £ e {1, . . . , d}. 
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(2) (reflection invariance) for any x,y eM^we have 

■Jx,y = J^Wx,^Wy (3.4) 

(3) (reflection positivity) if / : ^ M is absolutely summable with 

E f{x) = 0, (3.5) 

then 

E J^,vf{^)f{^^'^y)>^- (3.6) 

Given a translation-invariant Gibbs measure, we use the word magneti- 
zation to denote the expectation of the spin at the origin. The statement of 
our general result can then be viewed as a restriction on the possible values 
of the magnetization. However, not all magnetizations that can be physically 
produced are (provably) accessible to our methods. The reason is that the un- 
derlying Gibbs states for which our techniques work will have to satisfy the 
conditions of reflection positivity — in particular, they have to be obtained as 
weak limits of torus states. Our next item of business will be to define pre- 
cisely the set of "allowed values" of the magnetization. 

We will proceed as in Ref. [11]. Let Z\{j3, h) be the partition function in 
volume A — the boundary condition is irrelevant — and let F(/?, h.) denote the 
(physical) free energy defined as the limit of — |X| ^og -^a A increases to fill 
the entire (in the sense of van Hove [25]). The function h) is jointly 
concave, so we may let J^i,{P, h) denote the set of all pairs [e,^, such that 

F{(3 + A(3,h + Ah) - F((3, h) <e^A(3+ (m*, Ah) (3.7) 

for any A/3 e R and any Ah e R". Now J^{P, h) is a convex set so we 
let h) to denote the set of values for which there exists an such 

that [e*, rrii] is an extreme value of J^i,{P, h). Our main theorem then reads: 

Theorem 3.2. Consider the spin system on Z'^ with the Hamiltonian 
(1.1) such that the couplings {Jx,y) are RP, the inverse temperature P > and 
external field h E W^. For each k E [— tt, ttY, let J{k) = J^x^zd Jq,x^^^^ 
recall that J(0) = 1 by Eq. (2.1). Then for any E h), 

^p,h{^*)< inf $^,^(m)+/?n^^, (3.8) 

where n is the (underlying) dimension of the spin-space, k = maxsea |>S'p and 

JLJML (39) 
Jl-,.,]'('i''Yl- J(k)' 
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The useful aspect of Theorem 3.2 is that the error term S — /3n^^ can be 
made small by appropriate adjustment of parameters. A general statement of 
this sort appears in Proposition 4.10 but, typically, these conditions have to be 
verified on a case by case basis. Let us tend to the details of these adjustments 
later and, for the time being, simply assume that £ is small. Then, along with 
the obvious supplement of Eq. (3.8), $^,,i(m*) > inf^econv(n) ^p,h{'m), we 
have learned that the allowed values of the magnetization in the physical sys- 
tem nearly minimize the mean-field free energy. In this sense, the mean-field 
theory already provides a quantitatively accurate description of the physical 
system once £ <^ 1. In Sects. 3.3-3.4 we will use this fact to prove a first-order 
phase transitions in a few models of interest. 

To demonstrate the use of Theorem 3.2, let us consider the "evolution" of 
a typical MFT phase transition, in which two local minima of ^p^h exchange 
roles of the global minimizer as [3 varies. Specifically, let ms{P) and mx{P) 
be local minima of <l>^,h — one of which is always global — for (3 near some j3t, 
and suppose that ^/^^^("I'a) > ^i3,h{fns) for /3 > /3t and vice versa for /3 < (3t. 
Then Theorem 3.2 can be applied under the condition that, outside some small 
neighborhoods of ms{P) and mp^{(3) for /? T^i (3^, no magnetizations have a 
free energy within 8 of the absolute minimum. For (3 ^ (3x, this stipulation 
applies even to the neighborhood of ms{/3) and, for l3 ^ l3t, to the neighbor- 
hood of mA{/3). Then, Theorem 3.2 tells us that in the region P ^ /5t, the 
actual magnetization is near ms(/3), for P Pt^t could be near ms or ttia, 
and for /? ^ /?t it is only near mA(/3). On general grounds, as long as the dif- 
ference mA — ms is bounded uniformly away from zero, somewhere near Pi 
there has to be a point of phase coexistence. 

3.2. Mean-field philosophy 

In this section we will state some general facts about spin systems and their 
mean-field analogues. The stipulations that govern this section are rather mild; 
first we will assume that the Hamiltonian is of the form (1.1) with the Jx,y's 
satisfying the conditions of reflection positivity. Second, we will assume that 
the associated mean-field free-energy function defined in Eq. (2.8) has a unique 
minimizer. Finally, we will investigate the small- behavior of these models. 
The preferred viewpoint is a fixed dimension d with parameters /i — as defined 
in Eq. (1.2) — tending to zero or s — as defined in Eq. (1.3) — tending to d. 

We note that special cases (usually restricted to concrete models) have 
been addressed elsewhere; see, in particular, Ref. [34] and references therein, 
but there the only mechanism to force — > was the d — > oo limit which 
we find aesthetically somewhat unsatisfactory. Another possibility is to con- 
sider the aforementioned Kac limit which more or less boils down to infinite 
smearing out of the interaction. A contour-based analysis of this limit has been 
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carried out, but the technical aspects have so far been overcome only for very 
specific models [14-17, 36]. Here we provide a general result in this direction 
under the sole condition of reflection positivity. 

Theorem 3.3. (Mean-field philosophy) Consider the spin system 
as described above and let ^p^h be as in Eq. (2.8). Suppose that the pa- 
rameters /5 > and G M" are such that $/3,h has a unique minimizer m 
on Conv [Vt) in Eq. (2.8). Let (ji"J) be a sequence of coupling constants that 
are RP and obey Eq. (2.1), and let (— be a sequence of translation and 
rotation-invariant Gibbs states corresponding to these couplings. If the se- 
quence of integrals J'n, obtained from (Jx^y) via Eq. (3.9), satisfies 

J^n^O as n ^ oo, (3.10) 

then we have the following facts: 

(1) The actual magnetization tends to m, i.e., 

{So)fl m. (3.11) 

(2) The energy density tends to its mean-field value, i.e., 

{iSo,lmo + h))^;l^Eim), (3.12) 

where mo is as in Eq. (2.3) and E{m) is as in Sect. 2.1. 

In particular, in the limit n — > oo, the spin variables at distinct sites become 
independent with distribution given by the product of the titled measures 

e(^'^"^+'^)-«(^^+'^)/xo(dS'). (3.13) 

Here /Iq is the a priori measure. 

The preceding — as is the case in much of the principal results of this 
paper — reduces (the ^ limit of) the full problem to a detailed study of 
the associated mean-field theory. Two specific models will be analyzed in great 
detail shortly (see Sects. 3.3 and 3.4); let us mention two other well known (or 
well studied) examples. 

First are the 0{n) spin systems at zero external field. Here each takes 
values on the unit sphere in with a priori uniform measure. In the mean- 
field theory of these models, the scalar magnetization m(/3) vanishes for P less 
than some Pc while for /3 > /5c it is the maximal positive solution of a certain 
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transcendental equation (see, e.g., Ref. [34]). In particular, this solution rises 
continuously from zero according to 

|m(/5)| = (/3 - ^f^- [C{n) + 0(1)] , n ^- (3.14) 

By Theorem 3.3, the actual magnetization converges to this function but, un- 
fortunately, our control is not strong enough to rule out the possibility of small 
discontinuities (which vanish as — > 0). 

A less well known but very interesting example is the cubic model where 
the spins point to the center of a face on an r-dimensional unit hypercube, 
i.e., Sx E VL = {±ei, . . . , ±e,.}. For r > 3 the transition in this model is 
first order (and was analyzed in Ref. [11]). The case r = 2 reduces to an 
Ising system but the borderline case, r = 3, while still continuous, features 
a somewhat anomalous (namely, tricritical) behavior. Indeed, for this system, 
the mean-field magnetization obeys 

|m(/3)| = (/? - ^f'^ [C + 0(1)] , (3.15) 

where /9c = 3. Once again, the actual magnetization converges to such a func- 
tion but the control is not sufficient to rule out small discontinuities. 

While these sorts of results do not establish any critical behavior in par- 
ticular systems, they could represent a first step in proving that a variety of 
(mean-field) critical behaviors are possible. 

3.3. Results for the Potts model 

Our first result concerns the zero-field g'-state Potts model with q > ?>. 
Let F{(3,h) denote the free energy of the Potts model with the Hamiltonian 
in Eq. (2. 10) and let m*(/9) be the quantity 

--. (3.16) 

h=o q 

(An alternative definition of m*(/?) would be the limiting probability that the 
spin at the origin is "1" in the state generated by the boundary spins all set 
to "1.") Let tumf — rriMFif^) be related to the maximal positive solution 9 of 
Eq. (2.16) by uimf — Then we have: 

Theorem 3.4. Let q > 3 be fixed. For each e > there exists 5 > 
with the following property: For any d > 1 and any collection of coupling 
constants {Jx,y) on that are RP, obey (2.1) and for which the integral J' in 
Eq. (3.9) satisfies < 5, there exists a number (3i G (0, oo) such that 



(3.17) 
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holds and such that the physical magnetization rrii, — mi,{(3) of the correspond- 
ing q-state Potts model obeys the bounds 

m,{/3)<e for /3 < A (3.18) 

and 

\mM - rnuFm <e for P> j3,. (3.19) 

In particular, whenever the integral ^ is sufficiently small, i— > m*(/?) under- 
goes a jump near the value j3^. A similar jump occurs (at the same point) in 
the energy density. 

This statement extends Theorem 2.1 of Ref. [11] to a class of spread-out 
RP interactions. (A minor technical innovation is that the bound in Eq. (3.19) 
holds uniformly.) As a consequence, we are finally able to provide examples of 
interactions for which theq — 3 state Potts models in dimension d — 3 can be 
proved to have a first-order transition. Similar conclusion holds for all q > 3 
but, unfortunately, our requirements on the "smallness" of the corresponding 
parameters are not uniform in q. 

In d = 1, we show that the long-range Potts models with power-law de- 
caying interactions go first order once the exponent of the power-decay is be- 
tween one and two. Models in this category have been studied in Ref. [40] in 
the context of percolation; the domination techniques of, e.g., Ref. [3] then im- 
ply the existence of a low temperature phase. However, the percolation-based 
approach alone is unable to tell whether the transition is discontinuous or not. 
Some additional discussion is provided in Sect. 3.5. 

Our next item of interest will be the same system in an external field, as 
described by the full Hamiltonian (2.10). For reasons alluded to in Sect. 2.2, 
we will restrict our attention to the g > 4 cases. 

Theorem 3.5. Let q > 4: be fixed and let us consider the q-state Potts 
model with coupling constants Jx,y that are RP and obey Eq. (2.1). Then there 
exists 5o > and a function ho : (0, Sq] — > [0, h^), where is as in Eq. (2.18), 
such that if (3.9) obeys ^ < 5 with some 5 < 5o, then there exists a func- 
tion A: (— oo,/io) (0,oo) with the following properties: 

(1) A first-order transition ( accompanied by a discontinuity in the energy 
density and the magnetization) occurs at the parameters {h, Piih)), for 
any external field h e (— oo, /iq)- 

(2) Let m-i,{P, h) be the "spin-1 density" defined by the right partial deriva- 
tive -^F{(3, h). Then there exists an hi — hi{S) < such that 
h I— > myf(/3, h) has a discontinuity at field strength h such that (5 — (3t{h) 
provided that h e {hi, ho). 
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The function ho is decresing while hi is increasing. Moreover, lim^jo ^o(<^) — 
he andlimsio hi{5) — — oo. 

The second part of the theorem asserts that, even if state "1" is suppressed 
by the field, the order-disorder transition will be felt by the "spin-1 density" 
m*(/3, h). There is no doubt in our mind that the restriction to h > hi in this 
claim is only of technical nature. Our lack of control for h very large negative 
stems from the fact that the jump in the mean-field counterpart of m*(/3, h) 
decreases exponentially with \h\ash ^ — oo. Theorems 3.4 and 3.5 are proved 
in Sect. 6. 

3.4. Results for the Blume-Capel model 

The Blume-Capel model is a system whose spins ax take values in the set 
= {— 1,0,1} with a priori equal weights. The Hamiltonian is given most 
naturally in the form 

= /3 J] J.,,K - ^yf - A^K)^ -hY^c^x. (3.20) 

{x,y) X X 

As is easy to see, a temporary inclusion of the terms proportional to {a^y into 
the single- spin measure shows that this Hamiltonian is indeed of the general 
form in Eq. (1.1). 

If we consider the situation at zero temperature (/? = oo) with A and h 
finite we see that in the (A, /t)-plane there are three regions of constant spin 
which minimize l3M'{a). The regions all meet at the point /i = 0, A = 0; 
tentatively we will call the origin a triple point (and the lines phase boundaries). 
Ostensibly one would wish to establish that this entire picture persists at finite 
temperature. However, we will confine attention to the line /i = which is of 
the greatest interest. We will show, both in the context of mean-field theory 
and, subsequently, realistic systems that there is indeed a finite temperature 
first order transition at some At(/3). Of significance is the fact that this occurs 
at a At which is strictly positive; i.e., for 1 <^ /3 < oo, the point A = lies 
inside the phase which is dominated by zeros. 

We remark that results of this sort are far from new; indeed the proof 
of this and similar results represented one of the early triumphs of low tem- 
perature techniques Ref. [46]. The physical reason behind the shifting of the 
phase boundary is the enhanced ability of the "zero" phase over the plus and 
minus phases to harbor elementary excitations. Interestingly, in spite of the 
fact that our method relies on suppression of fluctuations, the corresponding 
entropic stabilization is nevertheless manifest in our derivation. In addition, 
while the contour-based approaches require a non-trivial amount of "low tem- 
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perature labor" to ensure that the interactions between excitations are Umited, 
our methods effortlessly incorporate whatever interactions may be present. 

To simplify our discussion, from now on we will focus on the situation 
at zero external field, i.e., h = 0, and suppress h from the notation. First let 
us take a look at the mean-field theory. Here we find it useful to express the 
relevant quantities in terms of mole fractions Xi,Xo, a;_i of the three spin states 
in Q. To within an irrelevant constant, the mean-field free-energy function is 

= 4/3a;ia;_i + /3xo(l - xo) + Axo + ^ x^logx^. (3.21) 

a=±l,0 

Here we have used the fact that Xi + Xo + x_i = 1. Our main result concerning 
the mean-field theory of the Blume-Capel model is now as follows: 

Theorem 3.6. For all (3 > and all A e R, all local minima of^p^x 
obey the equations 

xie^^=^-i = x_ie^^^i = a;oe^(^-'^°)+\ (3.22) 

Moreover, there exists a (3q < oo such that for all {3 > Po, any such (local) 
minimum is of the form that two components of {xi,Xq, x^i) are very near zero 
and the remaining one is near one. Explicitly, there exists a constant C < oo 
such that 

(1) If xq is the dominant index, then xi = X-i = |(1 — Xq) and we have 
that (l-xo) < Ce-^+\ 

(2) Ifxi is the dominant index, then x_i < Ce"*^'^ while Xq < Cq~^~^. A 
corresponding statement is true for the situation when x_i is dominant. 

Furthermore, consider two local minima at {(3, A), one dominated by Xq and 

the other dominated by Xi. Let (poil^, A) be the mean-field free energy corre- 
sponding to the former minimum and let 4>i{(3, A) be that corresponding to the 
latter minimum. Then 

MP, A) - MP, A) = A - e-'^^^ + 0(/3e-2/^) (3.23) 

where 0(/3e~^^) denotes a quantity bounded by a constant times [3e^^^ for 
all A in a neighborhood of the origin. In particular, for all (3 sufficiently large 
there exists \mf{P) — + 0(/3e~^^) such that the global minimizes of^p x 
have x±i <^ Ifor \ < Xmf{P) andxo <^ IforX > Xmf{P)- 

Theorem 3.6 is proved in Sect. 5.1. Next we will draw our basic conclu- 
sions about the actual system: 
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Theorem 3.7. Consider the Blume-Capel model in Eq. (3.20), with 

zero field (h = 0), inverse temperature (3 and the coupling constants {Jx,y) 
that are RP and obey Eq. (2.1). Let ,y he the integral in Eq. (3.9). There exist 
constants f3o G (0, oo) and C < oo such that if [5 > /?o and <^ e^^, then 
there is a function At: P2] ^ satisfying \Xt{P) — < such that 
any translation-invariant Gibbs state (— )/3,a obeys 

(1) {<yl)0,x < Cc-P ifX < \m 

(2) {al)p,x>l-Ct-l'ifX>\{P). 

Moreover, at \ = At(/3), there exist three distinct, translation-invariant Gibbs 
states (— )^,A' ^ ^ {+1)0)~1}' the typical configuration of which con- 
tains fraction at least 1 — Ce~^ of the corresponding spin state. 

We remark that the phase transition happens at a value of A which (at 
least for /3 » 1) is strictly positive. This demonstrates the phenomenon of 
entropic suppression (of ±1 ground states at A = 0) established previously 
in Ref. [46] by the contour-expansion techniques. The entropic nature of the 
above transition is also manifested by the fact that the free-energy "gap" sep- 
arating the distinct states decreases as /3 — > 00. This is the reason why, to 
maintain uniform level of control, we need to be smaller for smaller tem- 
peratures. Theorem 3.7 is proved in Sect. 6. 

3.5. Discussion 

We close this section with a discussion of some conjectures that can be ad- 
dressed via the above theorems. 

Starting with the intriguing results in Ref. [31] and culminating in 
Refs. [32, 33], A. Kerimov formulated the following conjecture (we quote ver- 
batim from the latter pair of references): "Any one-dimensional model with 
discrete (at most countable) spin space and with a unique ground state has a 
unique Gibbs state if the spin space of this model is finite or the potential of this 
model is translationally invariant." The conclusions of Theorem 3.4 manifestly 
demonstrate that this conjecture fails for the ID Potts model in external field. 
Indeed, for q > 3, h > and interactions decaying like with s e (1, 2) 
which are RP and satisfy the condition that the integral in Eq. (3.9) is suffi- 
ciently small, the Potts model has phase coexistence at some positive tempera- 
ture. However, it is clear that this system enjoys a unique ground state. 

In a recent paper [4], N. Berger considered random-cluster models with 
parameter q and interactions between sites x and y decaying as \x — y\~^, 
where d < s < 2d. He proved, among other results, that at the percola- 
tion threshold there is no infinite cluster in the measure generated by the free 
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boundary conditions. For ordinary percolation (i.e., q — 1), this implies con- 
tinuity of the infinite cluster density. As to the wired boundary conditions, 
for q = 2 — i.e., the Ising model — the classic results of Refs. [1,2] show that 
the magnetization vanishes continuously once the model is in the "mean-field 
regime" s G (1, %). However, for general random-cluster models with q > 1 
and wired boundary conditions, the situation remained open. 

While we cannot quite resolve the situation at the percolation threshold, 
our results prove that, for sufficiently spread out random-cluster models with 
RP couplings, there is a point where the free and wired densities are indeed 
different. To resolve the full conjecture from Ref. [4], one would need to es- 
tablish that the only place such a discontinuity can occur is at the percolation 
threshold. 

Our third application concerns the problem of partition function zeros of 
the Potts model in a complex external field with Re /i < 0. Here there have 
been numerical results [35] claiming that no such zeros occur for the nearest- 
neighbor 2D Potts model with g < 7. On the basis of the classic Lee- Yang 
theory [38, 49], absence of such zeros would imply analyticity of the spin-1 
density. The results of Refs. [5, 6, 8-10] rule this out for q very large and 
Theorem 3.5(2) also makes this impossible for reasonable values of q and suf- 
ficiently spread-out interactions (of course, for d = 1,2 this requires a power- 
law interaction). 

4. PROOFS: GENERAL THEORY 

The goal of this section is to prove Theorems 3.2 and 3.3. In Sect. 4.1 we 

present some general convexity results that provide the framework for the 
derivation of our results. However, the driving force of our proofs are the clas- 
sic tools of reflection positivity and infrared bounds which are reviewed (and 
further developed) in Sects. 4.2 and 4.3. The principal results of this section 
are Theorem 4.1 and Lemmas 4.2, 4.8 and 4.9. 

4.1. Convexity bounds 

We begin with an intermediate step to Theorem 3.2 which gives an estimate on 
how far above the mean-field free energy evaluated at a physical magnetization 
is from the absolute minimum. 

Theorem 4.1. Suppose {Jx,y) <^re translation and rotation invariant 
couplings on If' such that Eq. (2.1) holds. Let vp^h be a translation and 
rotation-invariant, infinite volume Gibbs measure corresponding to (3 > 
and h e R". Let {—)p,h denote the expectation with respect to vp^h (ind let 



Mean-field driven first-order phase transitions 



25 



= {So)p,h- Then 



^pArn^ < ^^^l^^^ ^pA^) + f {((-S^o, mo))^ ,^ - |m^|'}, (4.1) 



where tuq = Ea^ez^ Jo,xSx- 

Proof. The proof is very similar to that of Theorem 1.1 of Ref. [11]. 
Let A be a box of Lx ■ ■ ■ x L sites in Z'^ and let iVf a be the total spin in A, i.e., 
M\ = '}2xeA "^a;- Let us also recall the meaning of the mean-field quantities 
from (2.6-2.8). The starting point of our derivations is the formula 

glA|G(6) ^ ^ cib'^^)+^-^'^iSA\SA^)Zj,{SAc))^^^, b e M^ (4.2) 

which is obtained by invoking the DLR conditions for the Gibbs state u^^fi. 
Here J^(S'a|'S'a':) is as in Eq. (3.1) and Za(-Sac) is a shorthand for the parti- 
tion function in A given S^a^- 

The goal is to derive a lower bound on the right-hand side of Eq. (4.2). 
First we provide a lower bound on Za(5aO which is independent of bound- 
ary conditions. To this end, let (— )o,6 denote expectation with respect to the 
product measure 

^(6,M.)-|A|G(6)-Q^^(^5,^) (4.3) 

and let rrib denote the expectation of any spin in A with respect to this measure. 
Jensen's inequality then gives us 



Zx{S Ac) = e\MG(b) ^^-{b,MA)-0JifA(SA\SAc}^ 

> g|A|[G(6)-(6,m(,)] g-(/3jrA(SA|SAc)>o,6_ 



o,b (44) 



Now, (2.6-2.7) imply that G{b) — {b,mh) = S{mb), while the absolute 
summability of a; i— Jq^^ implies that for all e > there is a Ci < oo, de- 
pending on e, the Jx,y's and the diameter of Q, so that 

-{pjeA{SA\SAc))^^^> \A\E{mb) - (3e\A\ - (3Ci\dA\, (4.5) 

with E{mb) denoting the mean-field energy function from Sect. 2.1. (Note 
that we used also the normalization condition (2.1).) Invoking Eq. (2.8) and 
optimizing over all 6 e R", we thus get 

^a(5aO > e-l^l^^(^''^)-^^l^l-'^^^l^^l, (4.6) 



where Fmf{P, h) is the absolute minimum of $/3,/i(m) over all m e Conv (Q). 
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Having established the desired lower bound on the partition function, we 
now plug the result into Eq. (4.2) to get 

g|A|G(6) > ^g(b,MA)+/3^A(SA|SAc)\)^^e-|A|i^MF(/3A)-/3e|A|-/3Ci|aA|_ (^4 7^ 

The expectation can again be moved to the exponent using Jensen's inequality, 
now taken with respect to measure u/^^h- Invoking the translation and rotation 
invariance of this Gibbs state, bounds similar to Eq. (4.5) imply 

(/3J^(5a|5ac))^^^ 

>-|A|( J]^^o,.((^.,^o)>^,^ + (/i,mO-e') - C^ldA]. (4.8) 

Plugging this back into Eq. (4.7), taking logarithms, dividing by | A| and letting 
|A| — i> oo (with |(9A|/|A| — 0) followed by e | 0, we arrive at the bound 

G{h) - (6, m,) >-\Y. JoAi^x, So))^,^ - {h, - Fmf(/5, h). (4.9) 
Optimizing over b gives 

S{m,) - {h, m,)<^Yl '^o,a.((5., ^o))^^^ + Fmf(/?, h) (4.10) 

xezd- 

from which Eq. (4. 1) follows by subtracting 1 1 \ ^ on both sides. | 

Similar convexity estimates allow us to establish also the following 
bounds between the energy density and fluctuations of the weighted magne- 
tization mo: 

Lemma 4.2. Let k = supg^Q^S, S) and let {Jx.y) be a collection of 
couplings satisfying Eq. (2.1). For each /? > and h G M" there exists a 
number x = h) such that for any translation and rotation invariant Gibbs 
state {—)i3,h we have 



where mo = ^^.^z^ Jo,x andm^ = {So)p,h- 



I //3,h.' 

(4.11) 



Proof. We begin with a rewrite of the correlation function in the middle 
of Eq. (4.1 1). First, using the DLR equations to condition on the spins in the 
complement of the origin, we have 

((mo, So))^^^ = ((mo, VG'(/3mo + h))^^^. (4.12) 
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Next, our hypotheses imply that = {friQip^h — (VG(/3mo + h))^^^, and 
so 

((mo, VG{l3nno + h)) - |m^p 

= ((mo - m*, VG'(/3mo + h) - VG{(5m^ + h))) (4.13) 

For the rest of this proof, let E abbreviate the inner product in the expectation 
on the right-hand side. 

We will express S using the mean value theorem 

S = (mo - m^, [VVG'(6)](mo - m^), (4.14) 

where 6 is a point somewhere on the line between /3mo + h and /3m* + h. 
The double gradient WG{h) is a matrix with components (VVG'(6))ij = 
(S'J^5j^'^)o,6 - {Sf)Q,b{Sl^^)o,b- As was shown in Ref. [11], the ^^.operator 
norm of VVG(6) is bounded hy k = su-pseni^i ^) ^^^^ so we have 

S < f3K |mo - m*|^. (4.15) 

Taking expectations on both sides, and invoking Eqs. (4.12^.13), this proves 
the upper bound in Eq. (4.1 1). 

To get the lower bound we note that, fxo almost surely, the double gradi- 
ent VVG(6) is positive definite on the linear subspace generated by vectors 
from n. (We are using that ^2 is the support of the a priori measure yUo.) Since 
(3mo + h takes values in a compact subset of this subspace, we have 

> /3x|mo - m*|^ (4.16) 

for some (existential) constant x > 0. Taking expectations, the left inequality 
in (4.11) follows. I 

We emphasize that in its present form, the bounds (4. 1) and (4.11) are es- 
sentially of complete generality. Underlying most of the derivations in this pa- 
per is the observation that the variance term on the right-hand side of Eq. (4. 1 1) 
is sufficiently small. Via Eq. (4.1), the physical magnetization m* is then 
forced to be near one of the near minima of the mean-field free energy. This 
reduces the problem of proving discontinuous phase transitions to: 

(1) controlling the variance term in Eq. (4.1 1), 

(2) a detailed analysis of the minimizers of ^g f^. 

For (1), we will use the method of reflection positivity/infrared bounds dis- 
cussed in the following subsections. As mentioned before, this does impose 
some restrictions on our interactions and our Gibbs states. Part (2) is model 
specific and, for the Potts and Blume-Capel models, is the subject of Sect. 5. 
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4.2. Reflection positivity 

Our use of reflection positivity (RP) will require that we temporarily restrict 
our model to the torus Tl of Lx ■ ■ ■ x L sites. In order to define the interaction 
potential on this torus, we recall that the Jx,y's are translation invariant and 
define their "periodized" version by 

45 = E '^-.^+^- (4.17) 

where Lz is the site whose coordinates are L-multiples of those of z. The torus 
version of the Hamiltonian (1.1) is then defined by 

PJ^l{S) = - Y1 /^^iH-^., Sy) - J2 (S., h). (4.18) 

{x,y) x^Tl 
x,yeTL 

(Here, as in Eq. (1.1), the first sum is over all unordered pairs of sites.) 
Let Pi denote the Gibbs measure on whose Radon-Nikodym derivative 
with respect to the a priori spin distribution /io(dS') is the properly normal- 
ized Q-l^-^LiS)^ 

Let us suppose that L is even and let us temporarily regard T^, as a pe- 
riodized box {1, . . . , L}'^. Let be those sites whose i-th coordinate ranges 
between 1 and L/2 and let be the remaining sites. The two parts of the torus 
are related to each other by a reflection in the "hyperplane" P that separates 
the two halves from each other. (The geometrical image of the plane has two 
components.) Given such a plane P, we let denote the cr-algebra of events 
that depend on the configuration in Tj, and similarly for and T£. 

Let 'dp denote the reflection taking onto and vice versa (cf. the 
definition of 'd^''^ in Sect. 3.1). In the natural way, i^p induces an operator 'dp 
on the set of real- valued functions on (^2^^). Then we have: 

Definition 4.3. (RP on torus) We say that Pl is reflection positive 
if for every plane P as described above and any two bounded, -measurable 
random variables X and Y, 

El{Xi»*p{Y))^El{Yi»*p{X)) (4.19) 

and 

^^{xrpiX)) > 0. (4.20) 
Here is the expectation with respect to P^. 

Condition (4.20) in the above definition is often too complicated to be 
verified directly. Instead we verify a convenient sufficient condition which we 
will state next: 
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Lemma 4.4. Consider a collection of coupling constants {Jx,y)x,y£Z'i 

satisfying the properties of Definition 3.1 in Sect. 3.1. Then the measure P^, 
defined onTi using the periodized coupling constants from Eq. (4.17), is re- 
flection positive in the sense of Definition 4.3. 

Proof. This is a multidimensional version of Proposition 3.4 of [22]. | 

Remark 4.5. We note that the three classes of interactions listed in 
Sect. 1.2 are reflection positive. For the most part, interactions of this sort 
were discussed in Ref. [22]; however, for reader's convenience, we provide the 
relevant calculations below. 

(1) Nearest-neighbor/next-nearest neighbor couplings: Consider a func- 
tion / : Hi — > C which is nonzero only on the sites of Hi that are adjacent 

to Z'^ \ Hi. (By inspection of Eq. (3.6), for nearest and next-nearest neigh- 
bor interactions, this is the most general function that need to be considered.) 
Pick ?7 G M and consider the function 

gjix) = fix) + rifix + e,), j = 2, . . . , d, (4.21) 

and define a collection of coupling constants {Jx,y) by the formula 

J2 Jx,yW)f{^^'h)- E Y.^)9A^) (4.22) 

iieHi j=2,...,d seHi 

y&'^-^Wi 

Now the right-hand side is clearly positive and so the Jx,y^ satisfy the condition 
in Eq. (3.6). 

It remains to identify the explicit form of these coupling constants. 
Let X e Hi be a boundary site and let x' — ■d^^'^x be its nearest neighbor 
in Z*^ \ Hi. First we note that, for each x and j, there is an interaction of 
"strength" rj between x and its next-nearest neighbor x' + e^ and a similar 
interaction between x and the site x' — Cj. So, the next-nearest neighbors 
have coupling strength r]. As to the nearest-neighbor terms, for a fixed x and 
fixed j, there is the direct interaction with x' of strength 1 and there is a term 
of strength 77^. Thus, upon summing, the nearest-neighbor interaction has total 
strength (d - + r]"^). 

Since the overall strength of the interaction is irrelevant, the ratio of the 
strength of the next-nearest neighbor to the nearest-neighbor couplings has to 
be a number of the form ^ which, in particular, permits any ratio whose 

absolute value is bounded by 2(d-i) ■ 

(2) Yukawa potentials: Reflection positivity for the Yukawa potentials can be 
shown by applying the criterion from Lemma 4.4: Fix // > and let Jx,y — 
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e-MlN-2/lli. Then for any observable /: Hi R, 

^ J.,,/(x)/(^«y) 

where the operator kernel K: Z*^"^ — > U^~^ is defined by K{x,y) — 
exp{— /i^^^2 ~ Vjl}- This operator is symmetric and diagonal in the 
Fourier basis; a direct calculation shows that K has only positive eigenval- 
ues. This means that the right-hand side is non-negative, proving condition (3) 
of Definition 3.1. (The other conditions are readily checked as well.) 

(3) Power-laws : We begin by noting that all conditions on Jx,y in Definition 3.1 
are linear in Jx,y. Therefore, any linear combination of reflection positive Jx,y's 
with non-negative coefficients is also reflection positive. In particular, if we 
integrate a one parameter family of interactions against a positive measure, the 
result must also be RP. Now if we let 

poo 

Jx,y = / /x'~^e-''l^-^lid/x for s > 0, (4.24) 
Jo 

then Jj.^y — C{s)\x — y\^^ and so the power laws are RP as well. 

We observe that in the classics, particularly, Refs. [22,23], the above types 
of interactions are treated and the RP properties established with all distances 
expressed in ^2-norms. The derivations therein all rely, to some extent, on lat- 
ticization of the field-theoretic counterparts to reflection positivity which were, 
perhaps, better known in their heyday. Our ii derivations, while being a more 
pedestrian method of extension from d — 1, have the advantage that they are 
self-contained. 

4.3. Infrared bounds 

Our principal reason for introducing reflection positivity is to establish an upper 
bound on the two point correlation term in Theorem 4. 1 . This will be achieved 
by invoking the connection between reflection positivity and infrared bounds. 
For spin systems this connection goes back to Ref. [24] where infrared bounds 
were used to provide proofs of phase coexistence in certain continuous-spin 
models at low temperature. Here we will follow the strategy of Ref. [11], and 
so we will keep our discussion brief. 

In order to apply infrared bounds to the problem at hand we must first 
restrict consideration to those Gibbs states with the following two properties: 
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PropGrty 1 . (Torus state) An infinite volume Gibbs measure up^h is 
called a torus state if it can be obtained as a weak limit of finite-volume states 
with periodic boundary conditions. (The torus states need not correspond ex- 
actly to the values j3 and h.) 

Property 2. (Block averages) An infinite volume Gibbs mea- 
sure vp^h is said to have block average magnetization if 

lim — -j- = m*, vr h-almost surely. (4.25) 

Similarly, the measure is said to have block average energy density if 

lim ^ (5^, 5j,) = e^, up^h-almost surely. (4.26) 

{x,y} 

Here in Eqs. (4.25-4.26) the limits are along increasing sequences of square 
boxes centered at the origin. 

It is conceivable that not every (extremal) Gibbs state will obey these re- 
strictions, so the reader might wonder how we are going to detect the desired 
phase transitions. We will use an approximation argument which goes back to 
Ref. [11]. Recall the definition of the set h) of "extremal magnetiza- 

tions" from the paragraph before Theorem 3.2. Then we have: 

Lemma 4.6. For all P > 0, h E and all G h), there ex- 

ists an infinite volume Gibbs state I'p^hfor interaction (1.1) which obeys Prop- 
erties 1 and 2. 



Proof. This is, more or less. Corollary 3.4 from Ref. [11] enhanced to 
include the block average energy density. | 

Our next goal is to show that the right-hand side of Eq. (4.1) can be 
controlled for any Gibbs state satisfying Properties 1 and 2. To this end let 
D~^{x,y) denote the inverse of the (weighted) Dirichlet lattice Laplacian de- 
fined using the J^.^'s. Explicitly, we have 

/At, „ik-ix-y) 
' (4.27) 
,-7r,7r]<i (27r)M - J(k) 

where J{k) — Ylix&d. Jq,x^^^'^- We will always work under the conditions for 
which the integral is convergent. Our principal estimate is now as follows: 

Lemma 4.7. (Infrared bound) Assume that k ^ {l - J{k))~'^ is 

Riemann integrable. Fix /3 > 0, h. e R" and let vp^h be an infinite-volume 
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Gibbs measure for interaction (1.1) that satisfies Properties 1 and 2. Let {—)i3^h 
denote the expectation with respect to vp^h and let n be the dimension of the 
underlying spin space. Then the bound 

^ v,VyD-\x,y) (4.28) 

holds for all v: ^ 'C such that X^xez^ l"^^! ^ 

Proof. As this lemma and its proof are similar to Lemma 3.2 of Ref. [11] 
we will stay very brief. Let Jx^y denote the periodized interactions correspond- 
ing to the torus and let 

n={ i^n^. Y'^d) : 1 < < l} (4.29) 

be the reciprocal torus. It is easy to see that the A;-th Fourier component J^^^ {k) 
of the Jij's satisfies j(^)(A;) = J{k) for all k e T*. This means that the in- 
verse Dirichlet Laplacian on T^, can be written in terms of the original coupling 
constants, i.e., 

DT\x,y)^—— V . — . (4.30) 

L \ ^yj T* ^ 1 _ j(k) 

The infrared bound of Ref. [22] then says that, for any Gibbs state (— )^^^ on T^, 
we have 

{{Wx,Sx){Wy,Sy)f^l^<^ Yl i'^x,Wy)Dl\x,y) (4.31) 

for any absolutely summable collection of complex vectors (tOij^gT^ 
with Re lUj:, Im lUj; e R" and Ylixer^ ~ ^• 

Now let us consider a torus state i/^^h with almost-surely constant block 
magnetization. We will first prove that v^^h satisfies the L ^ oo version of 
Eq. (4.31). By the assumption on the Riemann integrability of ^Jj^^y 

Dl\x,y) D-\x,y), (4.32) 

L >oo 

independently of x, y. Letting all Wx be parallel, i.e., Wx — WxQ, where e is a 
unit vector in R", and passing to the limit L — > oo, we thus get 

Y WxWy{{Sx,Sy))^^^<^ Y WxWyD-\x,y) (4.33) 

whenever w: Z'^ ^ C is absolutely summable and J2xgZ'^ '^^ ~ ^• 

In order to make the m^'s appear explicitly on the left-hand side, we need 
to relax the condition on the total sum of the Wx's. Under the condition in 
Property 2, this is done exactly as in Lemma 3.2 of Ref. [11]. | 



Mean-field driven first-order phase transitions 



33 



4.4. Actual proofs 

A key consequence of the infrared bound is the following estimate on the vari- 
ance of the quantity mo = Ylix&j^ 

Lemma 4.8. (Variance bound) Consider a collection {J^^y) of 
coupling constants that are RP and obey Eq. (2.1), and let be the inte- 
gral in Eq. (3.9). Let {—)/3^h be a translation and rotation invariant Gibbs state 
satisfying Properties 1 and 2 and let = {So)/3,h- Then 



Proof. We have to show how the bound (4.28) is used to estimate the 
variance of mo. Let {vx) be defined by — Jq,x- Using Lemma 4.7 and 
Lemma 4.6, for any {—)i3,h as above, this choice of the f ^.'s leads to the variance 
of mo on the left-hand side of Eq. (4.28), while on the right-hand side the sum 
turns into the integral J^. | 

The proof of Theorem 3.2 is now reduced to two lines: 

Proof of Ttieorem 3.2. Combining Lemmas 4.6 and 4.8 with 
Eqs. (4.11) and (4.1), we obtain Eqs. (3.8-3.9). | 

Armed with the conclusions of Theorem 3.2, we can now finish also the 
proof of Theorem 3.3: 

Proof of Theorem 3.3. In light of the previous derivations, the claims 
in Theorem 3.3 are hardly surprising. The difficulty to be overcome is the 
fact that the limits in Eqs. (3.11-3.12) are claimed for sequences of any states, 
regardless of whether they obey Properties 1 and 2 above. 

We begin with the proof of part (1); namely, Eq. (3.1 1). Since m is the 
unique minimizer of $/3,h„ for each e > there exists 5 > such that 



is contained in a ball Ueirn) of radius e centered at m. By Eq. (3.8), 
once (5n^^ < 5, all of ^^{(3,h) must be contained in this ball. 
But, h) is the set of extremal magnetizations, and any magnetization m' 

that can be achieved in a translation-invariant state is thus in the convex hull 
of .>#*(/3, h). It follows that m' e U^{m), proving Eq. (3.1 1). 

ToproveEq. (3.12), let [e^.m^] be an extremal pair in /i). (Seethe 

discussion prior to Theorem 3.2 for the definition of these objects.) Let (— )/3,a 
be a translation and rotation invariant state for which 



(3{\mo-m,\')^^^<ny. 



(4.34) 



{m' e Conv {Q) : <^^,h{m') < Fmf{P, h) + S] 



(4.35) 



= {{So, fmo + h))^^^ and m* = {So)p,h 



(4.36) 
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and suppose the state satisfies Properties 1 and 2. (The existence of such a state 
is guaranteed by Lemma 4.6.) Combining Eqs. (4.11) and (4.34), we get 

< {{So, mo))^^^ - |m^|2 < kuJ^, (4.37) 

and so, invoking the result of part (1) of this theorem, is close to E{m^) 
once is sufficiently small. But this is true for all extremal pairs in h) 
and so it must be true for all pairs in h). Hence, h) shrinks to a 

single point as ^ | 0, which is what is claimed in part (2) of the theorem. 

To conclude the proof of the theorem, we need to show that the spin 
configuration converges in distribution to a product measure. Applying the 
DLR conditions, the conditional distribution of S'o given a spin configuration 
in Z'^ \ {0} is 

i.e., the distribution of Sq depends on the rest of the spin configuration only 
via mo = X^xeZ'* Jo.xSx- Hence, it clearly suffices to show that mo converges 
to m — the unique minimizer of $^ ft, — in probability. But this is a direct con- 
sequence of the convexity bound on the left-hand side of Eq. (4. 1 1) which tells 
us that, once the magnetization and energy density converge to their mean-field 
values, the variance of mo tends to zero. | 

While we cannot generally prove that, in systems with interaction (1.1) 
the magnetization increases with (3, the estimates in the previous proof provide 
a bound on how bad the non-monotonicity can be: 

Lemma 4.9. (Near monotonicity of magnetization) Let { Jx,y) 

be coupling constants that are RP and obey Eq. (2.1), and let be the in- 
tegral in Eq. (3.9). Let {3 < 13' and let m^, G h) and m^ e h). 
Then we have: 

|m*r < I'^^r + 1^^-^- (4-39) 

Proof. Let {—)j3,h and {—)i3',h be (translation and rotation invariant) 
states satisfying Properties 1 and 2 in which the above magnetizations are 
achieved. (Such states exist by Lemma 4.6.) By Eq. (4.1 1) we have 

((5o,mo)>^^^> |m,|2, (4.40) 

and Eqs. (4.1 1) and (4.37) yield 

((5o, mo)>^, < \m'^\'' + nnJ'. (4.41) 

But the quantities on the left are, more or less, derivatives of the physical free 
energy with respect to (3 (in the parametrization introduce in Eq. (1.1)). Hence, 
standard convexity arguments give us 

(('S'o,mo)>^,^>((5o,mo)>^,^. (4.42) 
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Combining these inequalities the claim follows. | 



4.5. Bounds for specific interactions 

Having presented the main theorem, we now argue that by appropriately ad- 
justing the parameters // and s in the Yukawa and power law terms of an in- 
teraction, one can make the integral as small as desired. We begin with a 
general criterion along these lines: 

Proposition 4.1 0. Let ) be a family of translation and reflection- 
invariant couplings depending on a parameter X. Assume that the J^y obey 
Eq. (2.1) andlet J\{k) = '^^xei'^ Jqx^^^^ the Fourier components. Suppose 
that the following two conditions are true: 

(1) There exists a S > and a constant C > such that for all sufficiently 
small A, we have 

i^j^>C, fce[-7r,7r]'^\{0}. (4.43) 

(2) The i^-norm of{j'^l) tends to zero a* A — > 0, i.e., 

&oE[^S]' = 0- (4.44) 



Then we have: 

*^^ = o. (4.45) 

Proof. Note that, by Eq. (2.1) and condition (1) above we have Ja(0) = 1 
and J\{k) < 1 for all k Q. (The reflection invariance guarantees that Jx is 
an even and real function of A;.) First we will bound the part of the integral 
corresponding to A; 0. To that end we pick r > and estimate 

I'^'^'N/ (4.46) 



fel<, {2ttY 1 - j^(k) ~ J\k\<r i'^^y C\k\ 

where Ci = Ci{6, d, C) < oo. Next we will attend to the rest of the integral. 
Let M(r) be the supremum of (1 — Ja(/i:))^^ overall /c G [— tt, tt]'^ with |A;| > r. 
By condition (1) above, we have that M(r) < ^r^~'^. Therefore, 

r « ^ Y\ [tl] ^ (4.47) 
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where we also used Parseval's identity. By condition (2) above, this vanishes 
as A — > 0, while the integral in (4.46) can be made as small as desired by 
letting r I 0. From here the claim follows. | 

Now we apply the above lemma to our specific interactions. We begin 
with the Yukawa potentials: 

Lemma 4.11. Let (Ji^y) be the Yukawa interactions with parame- 
ter n — as described in Sect. 1.2 — and suppose these are adjusted so that 
Eq. (2.1) holds. Then (Jx^y) obey conditions (1) and (2) of Proposition 4.10 
as 11 I Q with 5 = d — 2. Consequently, in dimensions d > 3, the correspond- 
ing integral in Eq. (3.9) tends to zero as J, 0. 

Proof. Let (Jx^y) be as above and let denote the Fourier transform. In 
order to handle the overall normalization effectively, we introduce the quantity 
by C^/i'^ '^xjto e^^'^'^ = 1 and note that converges to a finite and pos- 
itive limit as 1 0. From here we check that the ^^-norm in Eq. (4.44) scales 
as n'^ and so condition (2) of Proposition 4.10 follows. 

It remains to prove that 1 — Jij,{k) is bounded from below by a positive 
constant times | fcp, where | A;| denotes the £^-norm of k. First we claim that for 
all > there exists a constant A < oo such that for all k e [— tt, tt]*^, 

Mk) <1-V, \k\>Afx. (4.48) 

Indeed, an explicit calculation gives us 

J,{k) = /C,^e-^l-l^+^^- < /C,n{Re ^ _ }, (4.49) 

x^O j=l 

where we first neglected the condition x ^ 0, then wrote the result as the 
product over lattice directions and, finally, threw away some negative constants 
from each term in the product (the real parts are positive). Introducing the 
abbreviations a — e"'*, e — 1 — a and = 1 — cos{kj), the e-multiple of 
the j-th term in the product is now 

1 + aA,e 

e Re = — TT^ ■ (4.50) 

Now if > Aj the right-hand side is less than 1 + ae, while if < A^, then it 
is less than e + which is -C 1 once <C Aj. Going back to Eq. (4.49), if 
at least one component of k exceeds large constant times // (which is itself of 
order e), then the right-hand side of Eq. (4.49) is small. This proves Eq. (4.48) 
for ijl small; for all other // this holds existentially. 
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The condition (4.48) implies Eq. (4.44) for |A;| > A/j,. As for the comple- 
mentary values of k, here we pick a small number 9 and write 

1- J^{k)>C^H'^ J2 e-'^l^liil -cos(A;-x)]. (4.51) 
\x\i<e/n 

By the fact that \k\ < A^, the condition |a;|i < 9/ ji (with 9 sufficiently small) 
implies that 1 — cos(/c • x) > c{k ■ x)'^ for some c > 0. Plugging this into 
Eq. (4.51) and using that the domain of the sum is invariant under reflection 
of any component of x, the result will be proportional to The constant of 
proportionality is of order and so condition (1) is finally proved. | 

Next we attend to the power laws: 

Lemma 4.12. Let (Jx^l) be the power-law interactions with expo- 
nent s > d — see Sect. 1.2 — and suppose these are adjusted so that Eq. (2.1) 
holds. Then obey conditions (1) and (2) of Proposition 4.10 as s [ d 

with any 6 < d. Consequently, the corresponding integral in Eq. (3.9) tends to 
zero as s I d in all d> 1. 

Proof. Our first item of business will again be the overall normalization. 
Let Cg be the constant defined by 

C,(s - d) ^ l^lr' = 1. (4.52) 

As is not hard to check, Cs tends to a positive and finite limit as s I d. 
Since J2xj^o l^li"^^ uniformly bounded for all s > d,, the £^-norm in 
Eq. (4.44) is proportional to {s — d). This proves condition (2) of Proposi- 
tion 4.10. 

In order to prove condition (1), we first write 

1 - j^{k) = Cs{s -d)J2 l^lr'(l - cos{k ■ x)), (4.53) 

x^O 

where is the Fourier transform of the {Jx,l). Consider the set TZk — {x e 
Z'^: cos(A; • x) < 0}, which we note is the union of strips of width — and 
separation — of the order 0{l/\k\) which are perpendicular to vector k. A sim- 
ple bound gives us 

klr'(l - cos(A; • a;)) > ^ \x\^\ (4.54) 

XyiO X&TZk 
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Next we let TZ'^ — {x & U^: \x ■ k\ > tt}. The fact that \x\i^ decreases with 
distance allows us to bound the second sum in Eq. (4.54) by a similar sum 
with X E TZ'j^. Using the usual ways to bound sums by integrals, we thus get 

f dr 

1 - Js{k) > C{s -d) (4.55) 

^|fe-a;|>7r Fl 

where C is a positive constant (independent of ,s) and \x\ is the £^-norm of x. 
Extracting a factor of \kY~'^, the resulting integral times (s — d) is uniformly 
positive for all s > d. Hence we proved that for some d > 0, 

1 - Js{k) >c'\k\'-'^ (4.56) 

for all s > d and all k e [— tt, t^Y, and so condition (1) of Proposition 4.10 
holds as stated. | 

5. PROOFS: MEAN-FIELD THEORIES 
5.1. Blume-Capel model 

We begin by giving the proof of Theorem 3.6 which deals with the mean-field 
theory of the Blume-Capel model. The core of this proof, and other proofs 
in this paper, are certain facts about the mean-field theory of the Ising model 
in an external field. In the formalism of Sect. 2.2, this model corresponds to 
the q = 2 Potts model. The magnetizations are parameterized by a pair of 
quantities {zi, Z-i), where zi + Z-i = 1, which represent the mole-fractions of 
plus and minus spins. The mean-field free energy is given by 

^j,h = Jziz^i - hzi + ^ilogzi + 2;_ilog2;_i. (5.1) 

The following properties are the results of straightforward calculations: 

(11) If /i = and J < 2, then the only local — and global — minimum occurs 

at zi = z_i. 

(12) lfh = and J > 2, then there is only one local minimum with zi > z_i 
and it satisfies Jzi > 1 > Jz_i. A corresponding local minimum with 
with zi > z_i exists and obeys Jz^i > 1 > Jzi. 

(13) Let now h be arbitrary. If [zi, 2; i) is a local minimimum of 
then m — zi — z^i satisfies J(l — m^) < 1. 

These properties are standard; for some justification see, e.g., the proof of 
Lemma 4.4 in Ref. [11]. 
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Proof of Theorem 3.6. Let {xi, xo,x_i) be a triplet of positive vari- 
ables which corresponds to a local minimum of the Blume-Capel free-energy 
function $3, a from Eq. (3.21). A simple calculations shows that the deriva- 
tive of the entropy part of $^3 a is singular in the limit when any component 
of (xi, Xo, a;_i) tends to zero, while nothing spectacular happens to the energy. 
Therefore, the minimum must lie strictly inside the simplex of allowed values. 
Accounting for the constraint a;i + a;o + = 1, the condition that the gradient 
of x vanish at (xi, xq, ,t_i) translates into the equations (3.22). 

Due to the symmetry between xi and we may (and will) assume 
for simplicity that Xi > a;_i. First we claim that, under this condition, we 
have 4/3x_i < 1. Indeed, for a fixed xq, the Blume-Capel mean-field free 
energy $/3^a expressed in terms of {zi,Z-i), where z±i = x±i/{l — xq), is 
proportional to the Ising free energy (5.1) with J = 4/3(1 — xq). Since the 
Ising pair {zi, z^i) is at its local minimum, we have Jz-i = Aj3x-i < 1 by 
property (12) above. 

Once we know that x_i is small, the question is whether xq and xi divide 
the amount 1 — democratically or autocratically. Here we observe that, 
once again, for a fixed x_i, the (xi, Xo)-portion of the Blume-Capel mean- 
field free energy $^ a is proportional to its Ising counterpart in Eq. (5.1) with 
J = /?(! — a;_i) and h = 3/3a;_i — A. In light of property (13) above, the 
magnetization variable m — {x-i — xq)/{1 — a;_i) thus satisfies the bound 
J(l — m^) < 1. Using the inequality \/l — a >1 — a valid for all a < 1, we 
have 

^f^>l-jr^, (5.2) 
once /3 is sufficiently large. Some simple algebra now shows that this implies 

2/3min{a;i, Xo} < 1. (5.3) 

Using these findings in Eq. (3.22) and extracting appropriate inequalities we 
derive the bounds listed in (1) and (2) with C being a numerical constant. 

To derive the asymptotics (3.23) on the free-energy gap for A 0, let us 
first evaluate the free energy at a generic local minimum. Suppose (xi, xo, x_i) 
obey Eq. (3.22) and let O denote the logarithm of the quantity in Eq. (3.22). A 
direct calculation shows that then 

^f5,h = -4/3xix_i + pxl + e. (5.4) 

Now let us consider a minimum with xo dominant. Then the inequality — 
Xq) = Pi^i + X-i) ^ '^/4 < 1 shows that the (xi, x_i) Ising pair is subcritical. 
By (II) above we must have xi = x_i = |(1 — xo) and, as is seen by a direct 
calculation, xq can be determined from the equation 

= 2e-'^+\ (5.5) 

Xo 
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In particular, for A bounded we have 1 — xq — 2e~^+^ + 0{e~'^^). Similarly, 
if the minimum corresponds to a triple dominated by Xi, our bounds show 
that xq — 1 — xi + 0(e~^^) and so we have 

= (1 - + 0(e-4^))e^+^+o(^«"'). (5.6) 

From here we have 1 — xi = + 0(/3e~^^). 

Now we are ready to derive Eq. (3.23). First, using that O — logxo + 
P{1 — 2xo) + A we have 

(f)o{p, A) = -4/?xix_i + (3{1 - xoY + A + logxo 

= A-2e-^+^ + 0(e-2'3). (5.7) 

Next, in light of O — logxi + 4/3x_i and the bounds proved on x_i in (2) 
above we have 

0i(/?, A) = -APxix^i + 13x1 + ^ogxi + 4/3x_i 

= -e-^-^ + 0(/3e-2^). (5.8) 

Combining Eqs. (5.7-5.8), the desired relation (3.23) is proved. | 

We finish this section with a computational lemma that will be useful in 
the proof of Theorem 3.7: 

Lemma 5.1 . There exists a > and, for each C ^ 1, there exists 
< oo such that the following is true for all (5 > (5q and all A with |A| < 
Cq~I^: If {xi , Xq, x_i) is a triplet with 

max{xi, Xq, X-i} = 1 — Cq~^ , (5.9) 

then 

^p,x{^i,x^,x^^)-\ni^p^^ > a(ClogC)e-^. (5.10) 
Here ^i3,\ is the function in Eq. (3.21) and inf ;^ is its absolute minimum. 

Proof. An inspection of Eqs. (5.7-5.8) shows that, once |A| < Cq~^, we 
have that | inf $/3,a| is proportional to Ce^^ and so we just have to prove that, 
once C is sufficiently large, <^i3^\{xi,xq, X-i) is proportional to (C log C)e~^. 
We will focus on the situation when the maximum in Eq. (5.10) is achieved 
by xi, the other cases are handled similarly. 

By our assumption we have that xq and x i are quantities less than Cq~^. 
Inspecting the various terms in Eq. (3.21), we thus have 

/3xo(l - xo) = /3xo + 0{(5C\-^^), 

= 4/3x_i + 0(/3C^e-^^), (5.11) 
xi logxi = -Ce-^ + ©(C^e-^^), 
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Plugging these back into the definition of a we get 

^p,x{xi,Xo,x^i) = Xo[/3 + logXo] +a;_i[4/3 + logx_i] 

+ Xxo-Ce-^ + 0{f3Ch-^^). (5.12) 

Now |Axo| < |A| < Ce"'^, and if Po is such that /?Ce"^ < 1, the last three 
terms on the right-hand side are all of order Cq~^. It thus suffices to to prove 
that the first two terms exceed a constant times (C log C)q~^. 

We first replace 4/? by B in Eq. (5.12) and then substitute xq = ijqQ'^ 
and X-i = |/_ie^*^. The relevant two terms on the right-hand side then equal 
e~^[l/o log Ho + y-i log y-i]. Under the condition (5.9) — which implies that at 
least one of the y's is larger than — this is a number of order e^^'^ClogC 
(for C >> 1). The right-hand side of Eq. (5.12) is thus of order e^^'^ClogC 
whenever (3 > (3o, which proves the desired claim. | 

5.2. Potts model: Preliminaries 

Next we turn our attention to the mean-field theory of the Potts model. In 
the present section we will first establish some basic properties of the (local) 
minimizers of the Potts mean-field free energy. The proof of Theorem 2.3 
dealing with positive fields is then the subject of Sect. 5.3. The negative-field 
portion of our results (Theorem 2.4) is somewhat more involved and we defer 
its discussion to Sect. 5.4. 

We invite the reader to recall the representation of magnetizations in 
terms of barycentric coordinates in Eq. (2.14), the mean-field free-energy func- 
tion ^p^h from Eq. (2. 15) and the transitional coupling for the g- state Potts 
model from Eq. (2.17). We begin with some general monotonicity properties 
of the minimizers: 

Lemma 5.2. (Monotonicity In h) For any p>0 we have: 

(1) Let h < h', let xi be the first barycentric coordinate of a global min- 
imum of \ and let x[ be the first barycentric coordinate of a global 
minimum of^'^^\i. Then xi < x'-^. 

(2) Let {xi, . . . ,Xq) be the probability vector corresponding to a global min- 
imizer of(^^^\. Ifh>0 then Xi > max{x2, . . . ,Xq}. Similarly, ifh<0 
then xi < min{a;2, • • • , Xq}. 

(3) If h ^ m{(5 ,h) is a differentiable trajectory of local extrema, then 



^^^^l{m{P,h))^-x,{(3,h), 



(5.13) 
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where xi{(3, h) is the first component ofm{(3, h) in the decomposition 
into (vi, . . . , Vg). 

Proof. (1) Let m e Conv Vt. Then we have 

- ^%{rn) = {h' - h)x,, (5.14) 

where xi is the first component of m. Let xi and x[ be as above and let m 
and m' be the corresponding minimizers. Then Eq. (5.14) implies 

4''i(m)-4t(m') 
< ^'^^ , — ^ — - (5.15) 
h— h 

Similar reasoning gives 

Combining Eqs. (5.15) and (5.16) gives the result. 

(2) Let h > and let (xi, . . . ,Xq) be a probability vector with xi < 
X2. Interchanging xi and X2 shows that, due to the interaction with the field, 
the g-tuple (x2, xi, . . . , Xq) has strictly lower free energy than (xi, . . . ,Xq), i.e., 
(xi, . . . , Xq) could not have been a global minimizer. Hence > X2. To rule 
out xi = X2 we note that xi,X2 > and so the gradient of the free energy, 
subject to the constraint xi + X2 = const, must vanish. Hence xie^^^'^~'^ = 
X2t~^^^ which forces Xi ^ X2. The cases /i < are handled similarly. 

(3) This is a consequence of the fact that the gradient V$^^)j vanishes at 
any local extremum in the interior of Conv (Q) . | 

Lemma 5.3. (Monotonicity in 13) FixheR.If(3^ m(/3, h) is a 
differentiable trajectory of local extrema, then 

^^^^,{m{P,h))^-hm{p,h)\'. (5.17) 



d/3 



Proof. The proof is analogous to that of Lemma 5.2(3). | 

The next lemma significantly narrows the list of possible candidates for 
global minimizers: 

Lemma 5.4. (Symmetries of global minimizers) Let ^^^\{m) 

be the mean-field free-energy function. Let m e Conv ^ be a global minimum 
of and let (xi, . . . ,Xq) be the corresponding probability vector of 
barycentric coordinates. 
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(1) lfh>0, then 

Xi > X2 — ■ ■ ■ — Xq. (5.18) 

(2) If h < 0, then . . . , Xg) is a permutation in indices X2, ■ ■ ■ ,Xq of a 
vector with 

Xl < X2 = • • • = Xq-i < Xq. (5.19) 

Proof. The main idea of the proof is that the variables X2,...,Xq, prop- 
erly scaled, behave like a (g — l)-state, zero-field Potts model. Abusing the 
notation slighly, let us write . . . , Xq) instead of $|j')^(m) whenever m 

corresponds to the probability vector (xi, . . . , Xq). In looking for global min- 
ima, we may assume that all x^'s satisfy Xk G (0, 1). Letting 

= A; = 2,...,5, (5.20) 

1 — Xi 

this allows us to write 

...,xq) = {l- a;i)$J;7:i^),o(^2, • • • , + R{x,), (5.21) 

where R{xi) is a function of Xi (and (3 and h). The rest of the proof is based 
on some basic properties of the zero-field Potts free energy for which we refer 
the reader back to Sect. 2.2. 

Let (xi, . . . , Xq) correspond to a global minimum. A principal conclu- 
sion coming from Eq. (5.21) is that the components of the vector (^2, . . . , Xq), 
ordered increasingly, satisfy X2 = ■ ■ ■ = Xq^i < Xg. Using part (2) of 
Lemma 5.2, this immediately implies Eq. (5.19). To prove Eq. (5.18), let 
h > and let (xi, . . . , Xq) be a global minimizer at zero field with maximal 
value of xi. By general facts about the zero-field problem, this forces /?(1 — 
xi) < (3^^^ and, since part (2) of Lemma 5.2 implies that xi > xi, also 

/3(1 — xi) < P^^^. Hence, the variables {z2, ■ ■ ■ ,Zq) correspond to a subcrit- 
ical Potts model and thus Z2 — ■ ■ ■ — Zq. Invoking again Lemma 5.2(2), we 
have Eq. (5.18). | 

5.3. Potts model: Positive fields 

Next we will focus on the cases with h > 0. Our first step is to characterize 
the local and global minima of m i— ^^^\(m) for m restricted to satisfy 
Eq. (5.18). While we could appeal to the "on-axis" formalism from Ref. [11], 
we will keep the requisite calculations more or less self-contained. 

For any probability vector satisfying Eq. (5.18), let us consider the 
parametrization 9 — -^m, where m denotes the scalar magnetization defined 
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via xi = i + m and Xk — ^ — k — 2, . . . ,q. (The physical values of 9 are 

9 e [0, 1].) Let (t>i3,h{(^) denote the value of $^^|j(m) where m corresponds to 
the above {xi, . . . ,Xq). Then we have: 

Lemma 5.5. ("On-axis" minima) The local minima of 9 ^ 
4>p,h{9) are solutions to the equation 9 — f{9), where 

Moreover, let Pq = Then 

(1) For all (5 < (5q and all /i e R, the equation 9 — f{9) has only one 
solution. 

(2) For (3 > (5q there exists an interval {h^,h+) such that 9 — f{9) has 
three distinct solutions once h e {h- , and only one solution for h 

At h = h±, there are two distinct solutions. Once h ^ h±, 
only the extreme solutions ( the largest and the smallest) correspond to 
local minima of 9 i— > <Pi3,h{9). 

Finally, for each j3 > Po, there exists a number hi = hi{l3) G such 
that the global minimizer of 9 ^ 4>/3,h{9) is unique as long as h ^ hi. On 
the other hand, for h — hi there are two distinct global minimizers ( the two 
extreme solutions of9 — f{9)). 

Remark 5.6. Although the above holds as stated in complete gener- 
ality, it is only useful (in the present context) for f3 < /3^p. In particular, 
for (3 > (3^1, while hi((3) continues on taking negative values, it does not 
correspond to any equilibrium commodity. 



Proof of Lemma 5.5. Since the derivative of ^ i-^ 0/3,fe(^) diverges as 6 
tends to either zero or one, all local minima will lie in (0, 1). Differentiating 
with respect to 9 we find that these must satisfy f{9) ~ 9 with / as given 
above. 

In order to characterize the solutions to ^ = /(^)> let us calculate the first 
two derivatives of this function: 

and 

rW = '^' e^.> + ,-l (' - /(")) - + • (5.24) 
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Since we also have f{9) < 1, we find that / is strictly increasing, strictly 
convex for 9 < 9i and strictly concave for 9 > 9i, where 9i is the inflection 
point of /, which is given by 

Q^e+h + 5-1 ^ 2' ^^'^^^ 

i.e., = g — 1. In particular, the derivative f'{9) is maximal at 6' = 9i, 

where it equals f'{9i) = f 

Let us suppose that f'{9i) < 1, which is equivalent to /3 < Pq. Then 
there is only one solution to ^ = /(^)' proving (1) above. Let us now assume 
that f'{9i) > 1. The fact that increasing h amounts to "shifting the graph of / to 
the left" implies that there exists an ho such that 9i solves 9 = f(9) for h = ho. 
Similar arguments show that there exists a unique value /i+ > ho such that the 
diagonal line (at 45°) is tangent to the graph of / at some 9 < 9i, and a similar 
value h^ < ho such that the diagonal line is tangent to the 9 > 9i portion of 
the graph of /. For h E [h-, h^], there are altogether three solutions, labeled 
< < ^u, where f'{9) < 1 at 9 = 9^ 9u while /'(^m) > 1 (with the 
inequalities strict when h ^ h±). 

The "dynamics" of these solutions as h changes is easy to glean from 
the above picture. First is defined for all h < h+ while 6'u is defined for all 
h > h_. Now, as h decreases through h_, the middle and upper 9u solutions 
merge and disappear; and similarly for 9m and 6'l as h increases through h+. 
Only the remaining solution continues to exist in the complementary part of 
the /i-axis. Clearly, both and are continuous and strictly increasing on the 
domain of their definition with — >^ as /i — > — oo and — ^ 1 as /i — > oo. 
Since (t)g.h{9) has local maxima at = and 1, we must have that 9]^ and 6'u 
are local minima and 9m is a local maximum of (These are strict except 
perhaps aih^ h±.) This finishes the proof of (2). 

It remains to prove the existence of the transitional field-strength hi. By 
Lemma 5.4, every global minimizer m i— > ^^p\{m) corresponds to either 
or ^u- Observe that, since and 6'l never enter the portion of the graph of / 
where /' exceeds one, we have 9\j > 9\j{h^) > 9]^{h ) > 9]^ and so the 
difference 6'u — 6'l is uniformly positive. Consequently, the values \ at the 
corresponding magnetizations change at a strictly different rate with h (see 
Lemma 5.2). In particular, there exists a unique point G h+), where 
the status of the global minimizer changes from ^l to ^u- By continuity, dXh — 
hi, both one-sided limits are minimizers of $3 ft. | 

Now we are ready to finish the prove of Theorem 2.3. 

Proof of Theorem 2.3. Most of the claims of the theorem have already 
been proved. Indeed, let hi be as in Lemma 5.5 and let (3 > (3^1. By the 
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properties of the zero-field Potts model, the maximal solution to ^ = /(^) is 
a global minimizer of ^ ^ (f>p,oi^)- It follows that < for /3 > (5^1. 

Invoking also Lemma 5.4(1), we thus conclude that for /3 < /3o or f3 > 
and h > 0, the global minimizer of m i— > $^^)j(m) is unique, while for (3 E 

{Po,Pmp) this is only true when h ^ h\{ji). This establishes parts (2) and 
(3) of the theorem. It thus remains to prove the strict inequality between xi 
and X2 — ■ ■ ■ = Xi in part (1) — the rest follows by Lemma 5.4(1) — and the 
properties of /3 i— > /ii (/3) in part (4). 

(9)1 

mfJ 



First, it is easy to see that hi is continuous. Indeed, let /3' e {(3o, (3^ 



and suppose that (5 ^ hi {(5) has two limit points as /3 — > (5'. By a simple 
compactness argument, there are two distinct minimizers of for h at these 
limit points, which contradicts the uniqueness of hi{(5'). Applying this to (5' — 
/S^^l, we thus have that hi{f3) ^ as (3 ^ (^l^l. 

Second, we claim that /3 i— > /ii(/3) is actually strictly decreasing. To this 
end, let m+(/3) and m_(/3) denote the values of the two global minimizers 
of m H- > $^^jj(m) at h — hi{(3) and let Xi{(3) and Xi{(3) denote the corre- 
sponding first components. From Lemmas 5.2 and 5.3 we can now extract 

d l |m.^(/j)|^-|m-(/;)p 

which the reader will note is the Clausius-Clapeyron relation. Since both xi 
and |m| are increasing with the scalar magnetization, the right hand side is 
negative and so /5 hi{(3) is strictly decreasing. 

Third, we turn our attention to the inequality Xi > X2 — ■ ■ ■ — Xg 
once /i > 0. In light of Eq. (5.18), it suffices to show that, for h > 0, the state 
with equal barycentric coordinates is not a local minimum once h > 0. This is 
directly checked by differentiating Eq. (2.15) subject to appropriate constraints. 
Finally, we will compute the value of h at the end of the line h f3+{h). 
Let 9j^{h) and 9_{h) denote the two distinct (extremal) solutions of f{9) = 9, 
with / as in Eq. (5.22), for P — /3^{h). As h increases, /3+ decreases 
to Po and 9± converge to a single value — the unique solution of f{9) — 9 
at P = Pq. But the inflection point, 9i, is always squeezed between 9^ and 9^, 
and so we must have 9o = 9i. Now the inflection point is characterized 
by e^^'+'' = g — 1 and the equation 9 = f{9) gives us that P+{h) = Po 
ath — he. I 



5.4. Potts model: Negative fields 

The goal of this section is to give the proof of Theorem 2.4. The difficulty here 
is that, on the basis of Eq. (5.19), the full-blown optimization problem is in- 
trinsically two-dimensional. We begin with some lemmas that encapsulate the 
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computational parts of the proof. First we will address the symmetric minima 
by describing the solutions to the "on-axis" equation: 

Lemma 5.7. Let P > and h < and let g: [0, ^] R be the 
function 

pe-h _ 1 

Then g is increasing, concave and satisfies (?(0) > and g{6) < 1. In particu- 
lar, the equation g{0) = has a unique solution on [0, ^]. 

Proof. This is a result of straightforward computations which are not 
entirely dissimilar from those in Eqs. (5.23-5.24). | 

The two-parameter nature of solutions of the form (5.19) will be handled 

by fixing the first barycentric coordinate and optimizing over the remaining 
ones. Here the following property of the resulting "partial minimum" will turn 
out to be very useful: 

Lomma 5.8. Let (3 > /9Mf7^^ and let a be the minimum of Vq and the 
quantity a satisfying f3{l—a) = ^\ For each x G [0, a], let Z2{x), .... Zq{x) 
denote the vector corresponding to the asymmetric minimizerof{z2, . . . , Zg) h-> 
^^i3{i^x) o('^2, ■ ■ ■ , Zq) with Z2 = ■ ■ ■ = Zq-i < Zg. Let ■ip{x) dcnotc the quan- 
tity $^^]j(m) evaluated at m = m{x) where 

m{x) = XYi + (1 - x)z2{x)y2 H ^ (1 - x)Zq{x)Yq. (5.28) 

Then 

^lJ"'{x) < for all x G [0,a]. (5.29) 

Proof. Let tlj{x) be as stated above. Let t = t{x) = — x) and let 
z{x) = {z2{x),. . . , Zg{x)) denote the asymmetric global minimum of o- 
This allows us to rewrite ip{x) as 

ilj{x) — ——x^ + x\og{x) + (1 — x) log(l — x) — hx 

+ (l-^)<to(^(^))- (5.30) 

We will write Z2 = ■ ■ ■ = Zg-i = ^ — and Zg = ^ + m{f), where m{f) 
is the maximal positive solution to 

q-l exp{t|E|m(t)}-l 

-m(t) — , „ 1 — . (5.31) 

q-2 ^> exp{i25im(i)} + g-2 
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The various steps of the proof involve two specific functions u{t) and a{t) 
defined by 

u{t) = t^^rn{t) (5.32) 

and 

gU(t) / gU(t) _ 



a(t) = —- 1 . (5.33) 

^ ^ e«(*) + q-2\ e"(*) + q-2j ^ ' 

We state these definitions here to faciUtate later reference. 

A simple argument gives that t ^ m{t) is smooth when t > /^^jT^^ so 
ip{x) is differentiable. The actual proof then commences by the calculation of 
the third derivative of ^(x): 

rix) = -1 + ^ 



x"^ (1 — x)"^ 

where m' and m" denote the first and second derivative of t t— > m{t) and 
where we have used Lemma 5.3 to differentiate $|V^^- Since we want to show 
ip"'{x) < and we know that x < a < V2, it suffices to prove the inequality 

m'(t) m"(t) /m'(t)\2 ^ _ 
m[t) m[t) V m[t) / 

Differentiating both sides of Eq. (5.31) and solving for m'{t) yields 

m'{t) _ a{t) 
m{t) ~ 1 - ta{t) ■ 



(5.36) 



Taking another derivative with respect to t allows us to express m"{t)/m{t) 
in terms of a{t) and a'{t). In conjunction with Eq. (5.36), this shows that 
Eq. (5.35) is equivalent to 

3 + t^<0. (5.37) 

a[t) 

Differentiating Eq. (5.33) and applying Eqs. (5.32) and (5.36), we have 

a'(t)^a(t)^-^^^^^(l-2—^ Y (5.38) 

Writing Eq. (5.37) back in terms of u{t), we see that Eq. (5.35) is equivalent to 
the inequality 

3(1 _ /to < uitfZ~'^l (5.39) 



Mean-field driven first-order phase transitions 



49 



The rest of the proof is spent on proving Eq. (5.39). 

We first use that x < a impUes t > (3^^^ = 2|5| \og(q — 2) and so the 

left-hand side of Eq. (5.39) increases if we replace t by P^^^^- After this, there 
is no explicit dependence on t and so we may regard the result as an inequality 
for the quantity u. Clearing denominators, substituting s = e", and recalling 
that u{t) > 2 log(g — 2) for a; < a, it suffices to show that 

7(s) = Ags + log s - AMog s - 35^ - SA^ (5.40) 

is strictly positive for all s > A^ and all g > 4, where X — q — 2 and 

A,^3{q-l)pl^'^-6{q-2). (5.41) 

Since /^mf^^^ > 2.5 for g > 4, we easily check that Aq > 10 once g > 4. 

First we will observe that 7 is actually increasing for all s > A^. Indeed, 
a simple calculation shows that, for such s, we have 7'(s) >u;{s), where 

oj{s) = Ag-l + 2s log s - 5s. (5.42) 

Next we find that mins>ocu(s) = Ag - 1 ~ 2q''K Since q''^ ^ 4.48 and 
Aq > 10, we have that u — and hence 7' — are strictly positive for s > A^. 
Hence 7 is increasing for all s of interest. 

Once we know that 7 is increasing, it suffices to show that 7(A^) is posi- 
tive. Here we note that 

7(A') = ^{q- 2)'{(g' - 3g + 6)2 log(g - 2) - 3(g - l)(g - 3)} (5.43) 
g o 



and so 7(A ) is positive once 



21og(,-2)>3(«fi^. (5.44) 

g"' — 3g + 6 



Noting that the right-hand side is less than 3, and using that 2 log 5 > 3, this 
holds trivially for g > 7. In the remaining cases g = 4, 5, 6, the inequality is 
verified by direct calculation. | 

Using Lemma 5.8 we arrive at the following conclusion: 

Corollary 5.9. Let q > 4, (3 > and h < 0. Then has at most 
one (symmetric) global minimizer with Xi < X2 — ■ ■ ■ = Xg and at most one 
(asymmetric) global minimizer with Xi < X2 — ■ ■ ■ — Xq^i < Xg. 

Proof. Let {xi, . . . , Xq) correspond to a minimizer of Since /i < 0, 
Lemma 5.4 allows us to assume that xi < X2 — ■ ■ ■ — Xq-i < Xg. If X2 — 
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■ ■ ■ — Xq, then a simple calculation shows that the quantity 9, which is related 

to Xi via xi = ^ — ^^9, obeys the equation g{6) = 9, where g is as in 
Eq. (5.27). By Lemma 5.7, such a solution is unique and so there is at most 
one symmetric minimizer. 

Next let us assume that Xq exceeds the remaining components. Note that 
we must have that (3{l—xi) > /^mjT^^ because otherwise Eq. (5.21) implies that 
{x2t . . iXq), properly scaled, would correspond to the (g— l)-state Potts model 
in the high-temperature regime. Since in addition xi < Vg, we are permitted 
to use Lemma 5.8 and conclude that xi is a minimizer of the function ^ from 
Eq. (5.30). As is seen from its definition and Eq. (5.29), ijj starts off convex 
(and decreasing) at x — and, as x increases, may eventually turn concave. 
In particular, there could be at most two points in [0, d] where t/j achieves its 
absolute minimum — one in (0, a) and the other at a. 

We claim that if 'tp'^d) < then d cannot be the first coordinate of an 
asymmetric global minimizer. Indeed, if ijj is strictly decreasing at d, then 
the free energy could be lowered by increasing the first component beyond a. 
Therefore, if ^''(5) < 0, then t/j has at most one relevant minimum in [0, a]. 
On the other hand, the above concavity-convexity picture implies that, once 
^'{d) > 0, there is only one point in [0, d] where ip is minimized. Hence, in all 
cases, there is at most one asymmetric minimizer. | 

The proof of Theorem 3.5 will require some comparisons between the two 
minimizers allowed by Corollary 5.9. These are stated in the following lemma. 

Lemma 5.10. Let q> 4, (3 > and h G (-oo, 0). Suppose that $^'^1^ 

(S) (S) (S) 

has two minimizers, one symmetric with x\ < = ■ ■ ■ — Xq and the other 
asymmetric with a;^'^^ < Xg^'' — • • ■ — < x'^^\ Then 

xf^ < xf^ and xf < xf\ (5.45) 

Moreover, let Ca = [x[^\^ + ■ ■ ■ + [xf and es = [xf\^ + ■■■+ [xf\^. Then 
there exists a constant > such that for any h € [— oo, 0) and any (3 > 

where both minimizes 0/$^'^)^ "coexist," we have 

ca - es > C5. (5.46) 

Both parts of this lemma are based on the following fact. Let (xi, . . . , Xq) 
minimizer of \ orde 
stationarity condition yields 

r^-^^-^x^-h ^ X2e~^''^ = . . . = x^e-^^^ (5.47) 



be a minimizer of \ ordered such that Xi < X2 — ■ ■ ■ — Xq^i < Xq. The 



and so let © denote the common value of this equality. Then we have: 
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Lemma 5.1 1 . Let h < and /3 > 0. IfQ and 0' correspond to two 
minimizers of and © = then the minimizers are the same ( up to 
permutations in the last q — 1 indices). 

Proof. Suppose that both minimizers are ordered increasingly. By h < 
and Lemma 5.2, Xi < X2 = ■ ■ ■ = Xq^i. The fact that {jz^, • • • , is the 

minimizer of ^^p^]^^^^ ^ — see Eq. (5.21) — then implies I3xk < 1 for all k = 
1, . . . , g — 1. Since the function r{x) — xe~^^ is invertible for x with Px < 1, 
equality of the O's implies equality of the first q — 1 coordinates. The constraint 
on the total sum implies equality of the Xq's as well. | 

Proof of Lemma 5. 10. We will first attend to the proof of Eq. (5.45). 
In light of Eq. (5.21), the {q — l)-state Potts system on {x2, . . . ,Xq) is at 
the effective temperature (3^^ — (1 — xf'')(3 for the symmetric minimizer 
and f3^Q^ = (1 — for the asymmetric minimizer. But for both symmetric 

and asymmetric minimizers to "coexist" we must have (3^^ < (3^f^^ ^ /^etr^ 
and so a;|^^^ < xf\ To rule out the equality sign, we note that if x^^'^ — xf\ 
then the corresponding 0's are the same and Lemma 5.1 1 thus forces equality 
of all components. Once l3^^^ < (3^^^ is known, xi^^ < xif'^ follows. 

In order to prove Eq. (5.46), let be the common value of \ for the 
two minimizers and let ©a and ©s be the corresponding ©'s. Let us take the 
logarithm of every term in (5.47), multiply the result for the j-th term by xj 
and add these all up to get 

- ^es = log ©s and - ^ca = log ©a- (5.48) 

As x[^^ < xf^ < V/3, we have ©a < ©s for all h e {—oo, 0); for h = 0, — oo 
this holds by a direct argument for the zero-field Potts model. Hence es < ba 
whenever the two minimizers are "coexist." 

To see that the positivity of ca — es holds uniformly in (h, (3) E [— oo, 0] x 
[0, oo], we use a compactness argument. First, we only need to worry about 
the /3's in a finite, closed interval Iq. Indeed, the effective temperature of the 
Potts model, /3eff = — xi), is a number between (3 and (3{1 — Vq) and so if 
either (3 < (3^^^ or (3(1 — V?) > Pmf^^^ ^^^^ coexistence of minimizers is 
possible. 

Next let us consider a sequence of (h, (3) in [— oo, 0] x Iq with a topology 
that makes this set compact. If ca — es tends to zero along this sequence, 
the above arguments imply that the asymmetric and symmetric minimizers 
must coalesce as the parameters tend to a limiting point. But this is impos- 
sible because by the second half of Eq. (2.17), the scalar magnetization of the 
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corresponding {q — 1) -state Potts model, which is proportional to the ratio of 
Xg^^ — ^2'^^ and 1 — x^^\ is always at least |5f • I 

Remark 5.1 2. The previous proof kept the distinctness of es and ca in 
the realm of the existential. A calculation actually shows that, for any h < Q, 
there are constants ei < 62 depending only on q such that es < ei and ca > 62 
whenever the two minimizers "coexist." 

Proof of Tfieorem 2.4. Fix /? > and h < 0. Corollary 5.9 implies 
that, up to a permutation in all-but-the-first component, ^^g \ has at most two 
global minimizers: one symmetric ms and one asymetric niA. This proves 
part (1) of the theorem. 

Among the global minima, the first barycentric coordinate Xi — h) 
is (strictly) increasing in h (see Lemma 5.2) and so the effective coupling 
Pes{h) = P{1 — xi{(3,h)), which governs the (q — l)-state Potts model 
on {x2, ■ ■ ■ , Xq), is decreasing. Now if (3sf{{h) > f^l^^ then only the asym- 
metric minimum is relevant, while if f3{h) < (3^^^ then only the symmetric 

minimum applies. Hence, for (3 G (/^miT^^ /^mf)' there is a unique /12 = ^2(/5) 
such that the role of minimizers changes as h increases through /i2- (For P 
outside P^), the minimizers are in qualitative agreement with those 

of h = — cxDor/i = 0~.) In particular, the minimizer is unique for h ^ h2{P) 
and both minimizers "coexist" for h = h2{P). 

Modulo the definition of function parts (2-4) of the theorem are 
proved. It remains to show that P 1— > h2{P) is strictly increasing (and thus 
invertible), continuous and with limits —00 and at the left and right end- 
points of il3^^\ P^), respectively. By Lemma 5.10, the quantities es and ca 
are separated by a "gap." A simple limiting argument (not dissimilar to that 
used in the proof of Theorem 2.3) now shows that h2 is continuous. Moreover, 
by Lemma 5.3, the norm-squared of all minimizers increases with f], and so /12 
is strictly monotone and the limits of /i2 at the endpoints of {Pmf^\(^mI) must 
be as stated. These facts allow us to define as the inverse of /i2 and verify 
all its properties in part (5) of the theorem. | 

6. PROOFS: ACTUAL SYSTEMS 

Here we will provide the proofs of our results for actual spin systems. The 
main portion of the arguments has already been given in Sects. 4 and 5. We 
will draw freely on the notation from these sections. The proofs are fairly 
straightforward (and mostly existential) and so we will stay rather brief. 

First we will attend to the zero-field Potts model: 
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Proof of Theorem 3.4. The proof is more or less identical to that of 
Theorem 2.1 of Ref. [11]; the only substantial difference is that now we are 
not permitted to assume that the magnetization is monotone (indeed, some of 
the Jx,y's, may be negative). We will base our arguments on the mean-field 
properties of the zero-field Potts model, as outlined in Sect. 2.2. 

Recall the mean-field free-energy function $^*q from Eq. (2.15). By the 

fact that the global minimizer of $^^q changes from symmetric to asymmetric 
as /3 increases through /3^p, we can make the following conclusions: Given /3 
P^, let Ue be an e-neighborhood of m = and let be the union of e- 
neighborhoods of the asymmetric minimizers. Then for each e > 0, there 
exists 5>0 such that for all (3 with \(3 - pj^l < ethe set 

Os^{me Conv (Q) : $J,'J,(m) - Fmf(/9, 0) < 5} (6.1) 

is contained in U V^. Moreover, if j3 — — e, then Os C Ue while 
at /? = pI^I + e, we have Os C V,. 

Let 0) be the set of "extremal magnetizations." By Theorem 3.2, if 

the integral in Eq. (3.9) is so small that /3f n = P^J^ < S for all /3 with 

P < Pmf + then C Os- Now the asymmetric minimizers have norm at 
least V2, and the near-monotonicity of the magnetization from Lemma 4.9 thus 
implies that, at some /?t with |/?t — /3^p| < e, the physical magnetization jumps 
from some value inside to some value inside Ve. The jump (of this size) is 
unique by Lemma 4.9. From here the claims (3.17-3.19) follow. | 

Next we dismiss the cases with non-zero field: 

Proof of Theorem 3.5. Let be the quantity from Theorem 2.3 
and Pulih) be the concatenation of functions /5+ and /5_ from Theorems 2.3 
and 2.4. An argument similar to the one used in the previous proof shows that, 
for each e > there exists 5 > 0, such that if P^^^nJ' < 5 and h < — e, a 
strong first-order transition occurs at some Pt{h) which is within e of P^{h). 
This transition is manifested by a jump in both magnetization and energy den- 
sity. This proves part (1) of the theorem. 

As to part (2), by Lemma 5.10 we know that the first components of the 
two minimizers are uniformly separated whenever h is confined to a compact 
subset of (— 00, /ic)- Since our general bounds in Theorem 3.2 imply that the 
physical magnetizations at [h, pt{h)) are very near their mean-field values pro- 
vided ..y is sufficiently small, also the first components thereof must be dif- 
ferent. Using the monotonicity of the first component of physical minimizers 
in h, the existence of a jump in mi,{P, h) on the transition line follows. | 

Proof of Theorem 3.7. The proof is based on Theorem 3.6 and 
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Lemma 5.1. Indeed, Theorem 3.6 implies that all minima are characterized 
by the fact that one of {xi,Xo, is larger than 1 — Ce~^. These minima 
are nearly degenerate for A of order e^'^ with free energy difference given 
by A — e~^ + 0(/9e~^). The goal is to show that the free energy is uniformly 
large (on the scale of e~^ in the complement of the Ce~^-neighborhood of 
these minima. 

Let C » 1 be the number exceeding the corresponding constant from 
Theorem 3.6 and suppose that |A| < Ce~^. Consider the set Og of all 
triplets {xi,xq,x_i) with xi -\- xq + a;_i = 1, such that max{xi, a;o, > 
1 — Ce~'^. We claim that for P > Po (with Pq depending on C), 

inf ^p,x{xi,xo,X-i)>a{ClogC)Q-'^, (6.2) 

where a is a positive number independent of C. Indeed, Theorem 3.6 implies 
that all local minima of $^ a lie in Oj^, and so the absolute minimum of ^p^x 
must occur on the boundary of C^. But the "outer" boundary of is not a 
possibility, and so the mimimum occurs at a point with max{a;i, Xq, x_i} = 
1 — Ce~^. The bound (6.2) is then a consequence of Lemma 5.1. 

Let now the integral in Eq. (3.9) be such that pJ^ -C (ClogC)e~''. 
Then Theorem 3.2 ensures that all physical magnetizations (from ^^,) are con- 
tained inside Op. However, by Eq. (3.23), for P such that A — e.~^ > 0{Pe.~f^) 
the set 0/3 contains no triplets with dominant x±i while for A — < 
0{Pq~^), there are no -dominant states. The standard thermodynamic ar- 
guments imply that the amount of zero-ness decreases as A increases. Hence, 
there must be a jump at some At = + 0(/3e~'^) from states dominated 
by O's to those where O's are very sparse. This finishes the proof. | 
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